• The best choice for the company!
    The best choice for the company!

    Speed up and ease the communication with devices around you.

  • Get the best solution for you!
    Get the best solution for you!

    Upgrade the call centers to unseen levels.

  • Let's get work done together.
    Let's get work done together.

    Future arrived here, too. Become a part of it.

  • The best choice for the company!
    The best choice for the company!

    Work even when your eyes or hands are busy.

  • Get the best solution for you!
    Get the best solution for you!

    Future arrived here, too. Become a part of it.

Main objectives and activities

  • The development of flexible text-to-speech synthesis (TTS) of high quality
  • The development of large vocabulary continuous automatic speech recognition (ASR)
  • The research and development of emotion speech recognition
  • The development of speech morphing systems
  • The development of natural language processing modules including dialogue management
  • The application of the developed speech technologies in Western Balkan countries:
    • in multimodal human-machine dialogue systems (IVR, smart phones, smart homes)
    • for purposes such as: text reading, text dictation, speech transcription
    • within aids for the physically disabled, visually impaired, speech impaired, hearing impaired.

The most important innovative results

One can listen to news at a number of speech-enabled web sites (Radio Television of Serbia - RTS, Radio Television of Vojvodina - RTV, eUprava, as well as several municipalities) using a computer or a smart phone. The visually impaired can listen to any text displayed on the screen using the software anReader based on AlfaNumTTS. The AlfaNumASR and AlfaNumTTS components have provided smart phones with basic speech generation and understanding functionalities in Serbian.

Further development of both large vocabulary ASR and more advanced TTS is based on the aforementioned speech and language resources. Both technologies will enable a much wider range of applications and will contribute to the preservation of Serbian and kindred languages in this new domain of communication – spoken dialogue between humans and machines.

 radio antenna pointed toward the sky

In a world where marketing is one of the most important elements of product placement, where there are a multitude of electronic media where it is possible to advertise your product, a problem of monitoring the flow of one’s own advertising campaigns arises.

In the past, a client, who paid for a particular audio recording to be broadcast on an electronic medium, did not have a reliable and inexpensive way to verify how many times the recording has indeed been broadcast and whether it has been broadcast under arranged conditions. Manual (human) monitoring of broadcast was clearly practically impossible, especially in case of a large number of media involved. The client, thus, had to believe that his recording was really broadcast as stipulated by the advertising contract. The media, in spite of all the equipment at their disposal, often fail to meet such obligations due to technical errors or for some other reasons. Regardless of the fact whether the error is accidental or not, the client is aggrieved.

Clients requiring information about the broadcasts of their advertisements or other audio material can now receive reports based on the information obtained from the automatic recognition system. This clearly shows the possibility of application by agencies dealing with the monitoring of music broadcast, such as copyright collecting agencies.

 

 

Speech

Speech is the basic means of communication between humans

Using speech, humans can convey their thoughts and feelings to others in a way much more intricate than in any other animal species, and thus the human speech system is the most complicated one...

Read more

ASR

Automatic Speech Recognition

Automatic speech recognition (ASR) is considered one of the greatest technical challenges of today, attracting attention of many researchers worldwide for more than half a century...

Read more

TTS

Text-to-Speech Synthesis

Text-to-Speech Synthesis (TTS) is the oldest speech technology, originating from as early as the 18th century, when first "speaking machines" appeared...

Read more