About the project:
The goal of this project is to create and improve automatic speech recognition (also known as speech-to-text) technology for Norwegian and Swedish languages. The newly developed technology should operate with high accuracy in various applications including online broadcast monitoring (TV, radio, internet podcasts, etc.), transcription of speeches in parliaments and similar public institutions, as well as spoken archive mining.
In the first year we have created a speech-to-text system for Norwegian, with the main challenge being the two co-existing written standards of Norwegian: Bokmål and Nynorsk. We have also improved the robustness of our system to environmental noise and cross-talk.
In the second year, we added Swedish to our existing Norwegian speech recognition. Recent advances in machine learning have allowed us to apply the so-called end-to-end models for both languages, which, although requiring large amounts of data for training, significantly outperform the older models.
Beneficiary and project partners:
NEWTON Technologies, a.s.
Norges teknisk-naturvitenskapelige universitet (NTNU)
Technická univerzita v Liberci
The NORDTRANS project benefits from a € 1.2 mil. grant from Norway Grants and Technology Agency of the Czech Republic. The project is carried out under the KAPPA funding programme for applied research, experimental development and innovation, managed by the Technology Agency of the Czech Republic.