• The best choice for the company!
    The best choice for the company!

    Speed up and ease the communication with devices around you.

  • Get the best solution for you!
    Get the best solution for you!

    Upgrade the call centers to unseen levels.

  • Let's get work done together.
    Let's get work done together.

    Future arrived here, too. Become a part of it.

  • The best choice for the company!
    The best choice for the company!

    Work even when your eyes or hands are busy.

  • Get the best solution for you!
    Get the best solution for you!

    Future arrived here, too. Become a part of it.

Main objectives and activities

  • The development of flexible text-to-speech synthesis (TTS) of high quality
  • The development of large vocabulary continuous automatic speech recognition (ASR)
  • The research and development of emotion speech recognition
  • The development of speech morphing systems
  • The development of natural language processing modules including dialogue management
  • The application of the developed speech technologies in Western Balkan countries:
    • in multimodal human-machine dialogue systems (IVR, smart phones, smart homes)
    • for purposes such as: text reading, text dictation, speech transcription
    • within aids for the physically disabled, visually impaired, speech impaired, hearing impaired.

The most important innovative results

One can listen to news at a number of speech-enabled web sites (Radio Television of Serbia - RTS, Radio Television of Vojvodina - RTV, eUprava, as well as several municipalities) using a computer or a smart phone. The visually impaired can listen to any text displayed on the screen using the software anReader based on AlfaNumTTS. The AlfaNumASR and AlfaNumTTS components have provided smart phones with basic speech generation and understanding functionalities in Serbian.

Further development of both large vocabulary ASR and more advanced TTS is based on the aforementioned speech and language resources. Both technologies will enable a much wider range of applications and will contribute to the preservation of Serbian and kindred languages in this new domain of communication – spoken dialogue between humans and machines.

 

Financier: Ministry of Labour and Social Care

Project duration: 2006 – 2007

Project partners:

  • The Association of the visually impaired of Serbia
  • Faculty of Technical Sciences, University of Novi Sad
  • AlfaNum, Novi Sad
Summary: Computers and speech software have become an important tool for the visually impaired, because speech software can now convert each text in Serbian into speech. The Audio library for the visually impaired (ABSS) is based on anReader, Serbian text-to-speech synthesizer. ABSS is a simple solution for the visually impaired. It allows them to search the whole library using various criteria (title, author, year of publication, place of publication, keywords, etc.). The library is easily extendable (with permission from the copyright holder), and one book can be simultaneously accessed by multiple users. Books are automatically converted into synthesized speech and as such can be burned onto CDs, to be listened to on regular CD players. Within this project ABSS was installed within the library system of The Association of the Visually Impaired of Serbia, and client applications, providing access to the ABSS over the Internet, were distributed to all members of the Association.

Speech

Speech is the basic means of communication between humans

Using speech, humans can convey their thoughts and feelings to others in a way much more intricate than in any other animal species, and thus the human speech system is the most complicated one...

Read more

ASR

Automatic Speech Recognition

Automatic speech recognition (ASR) is considered one of the greatest technical challenges of today, attracting attention of many researchers worldwide for more than half a century...

Read more

TTS

Text-to-Speech Synthesis

Text-to-Speech Synthesis (TTS) is the oldest speech technology, originating from as early as the 18th century, when first "speaking machines" appeared...

Read more

TTS Demo

TTS demo

ASR Demo

ASR demo


AlfaNum d.o.o.

Krajiška 41, 21132 Petrovaradin

Prodaja: +381 64 0550184
Podrška TTS: +381 66 8095758
Podrška ASR: +381 66 8096324
Administracija: +381 66 8094467