• The best choice for the company!
    The best choice for the company!

    Speed up and ease the communication with devices around you.

  • Get the best solution for you!
    Get the best solution for you!

    Upgrade the call centers to unseen levels.

  • Let's get work done together.
    Let's get work done together.

    Future arrived here, too. Become a part of it.

  • The best choice for the company!
    The best choice for the company!

    Work even when your eyes or hands are busy.

  • Get the best solution for you!
    Get the best solution for you!

    Future arrived here, too. Become a part of it.

Main objectives and activities

  • The development of flexible text-to-speech synthesis (TTS) of high quality
  • The development of large vocabulary continuous automatic speech recognition (ASR)
  • The research and development of emotion speech recognition
  • The development of speech morphing systems
  • The development of natural language processing modules including dialogue management
  • The application of the developed speech technologies in Western Balkan countries:
    • in multimodal human-machine dialogue systems (IVR, smart phones, smart homes)
    • for purposes such as: text reading, text dictation, speech transcription
    • within aids for the physically disabled, visually impaired, speech impaired, hearing impaired.

The most important innovative results

One can listen to news at a number of speech-enabled web sites (Radio Television of Serbia - RTS, Radio Television of Vojvodina - RTV, eUprava, as well as several municipalities) using a computer or a smart phone. The visually impaired can listen to any text displayed on the screen using the software anReader based on AlfaNumTTS. The AlfaNumASR and AlfaNumTTS components have provided smart phones with basic speech generation and understanding functionalities in Serbian.

Further development of both large vocabulary ASR and more advanced TTS is based on the aforementioned speech and language resources. Both technologies will enable a much wider range of applications and will contribute to the preservation of Serbian and kindred languages in this new domain of communication – spoken dialogue between humans and machines.


Our team’s continuous work on raising accuracy of automatic speech recognition has led to significant results. The error rate of speech recognition has been brought down to under 10% on a dictionary of more than 100,000 words for English and Serbian.

This technology has many applications on the market, among which are IVRs and call centers, and one of the latest examples is the speech recognition by a robot at Croatian Telekom.

Published 20.07.2017.

robot nao


Speech is the basic means of communication between humans

Using speech, humans can convey their thoughts and feelings to others in a way much more intricate than in any other animal species, and thus the human speech system is the most complicated one...

Read more


Automatic Speech Recognition

Automatic speech recognition (ASR) is considered one of the greatest technical challenges of today, attracting attention of many researchers worldwide for more than half a century...

Read more


Text-to-Speech Synthesis

Text-to-Speech Synthesis (TTS) is the oldest speech technology, originating from as early as the 18th century, when first "speaking machines" appeared...

Read more