![]() ![]() ![]() The number in the parentheses next to each dataset, e.g. All systems are scored only on the utterances with predictions given by all systems. Table 4: Results (%WER) for 3 systems evaluated on the original audio. Benchmarks from Gigaom are encouraging as shown in the table below, but I am not aware of any good wrapper around to make it usable without quite some coding (and a large training data set): On Microsoft Windows I use Dragon NaturallySpeaking, on Apple Mac OS X I use Apple Dictation and DragonDictate, on Android I use Google speech recognition, and on iOS I use the built-in Apple speech recognition.īaidu Research released yesterday the code for its speech recognition library using Connectionist Temporal Classification implemented with Torch. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned below for other platforms have. Wine + Dragon NaturallySpeaking + NatLink + dragonfly + damselflyĪll the above-mentioned native Linux solutions have both poor accuracy and usability (or some don't allow free-text dictation but only voice commands).silvius (built on the Kaldi speech recognition toolkit).IBM ViaVoice (used to run on Linux but was discontinued years ago).I have unsatisfyingly tried the following: It should not be restricted to voice commands, as I want to be able to dictate text. The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |