Speech recognition
In computer science and electrical engineering, speech recognition (SR) is the translation of spoken words into text. It is also known as "automatic speech recognition" (ASR), "computer speech recogni...
Google Video Looks At ‘Science Of Talking With Computers’
Language. Easy for humans to understand (most of the time), but not so easy for computers. This is a short film about speech recognition, language understanding, neural nets, and using our voices to c...
Kinect - Microsoft Research turns Kinect into sign language reader
A new piece of software allows Kinect to read Chinese Sign Language gestures using the device’s hand-tracking technology, then quickly translate it into text.
Kinect
Noticiario semanal de la comunidad bluebox presentado por Alex . Fuentes: 3djuegos / HOBBY CONSOLAS / Meristation Assasin's crees Revelation : http://www.you...
Speech recognition software
In computer science and electrical engineering, speech recognition (SR) is the translation of spoken words into text. It is also known as "automatic speech recognition" (ASR), "computer speech recogni...
Janus Recognition Toolkit (JRTk)
Janus Recognition Toolkit (JRTk), sometimes referred to as Janus, is a general purpose speech recognition toolkit developed and maintained by the Interactive Systems Laboratories at Carnegie Mellon Un...
Real-time transcription
Real-time transcription is the general term for transcription by court reporters using real-time text technologies to deliver computer text screens within a few seconds of the words being spoken. Spec...
Fluency Voice Technology
Fluency Voice Technology was a company that developed and sold packaged speech recognition solutions for use in call centers. Fluency’s Speech Recognition solutions are used by call centers worldwide ...
Sensory, Inc.
Sensory, Inc. is a Santa Clara based company which develops and makes speech technologies on both hardware (Integrated Circuit - IC or "chip") and software platforms for consumer products, offering IC...
Kinect - Microsoft Research turns Kinect into sign language reader
A new piece of software allows Kinect to read Chinese Sign Language gestures using the device’s hand-tracking technology, then quickly translate it into text.
VoiceBox Technologies
VoiceBox Technologies is a company focused on conversational speech recognition, search and information management. The company licenses its technology to makers of telematics, digital home, mobile ph...
Nokia 5250
The Nokia 5250 is a budget Nokia resistive touchscreen smartphone running on Symbian v9.4 operating system with a S60 5th Edition user interface. Its price before tax and subsidies is €115. It was ann...
Logogen model
The logogen model of 1969 is a model of speech recognition that uses units called "logogens" to explain how humans comprehend spoken or written words. Logogens are a vast number of specialized recogni...
Telephonetics
Telephonetics VIP was a software company that develops speech recognition and voice automation solutions. Telephonetics VIP is a subsidiary of Telephonetics plc which listed on the AIM Market (TPH) on...
List of speech recognition software

The following list presents notable speech recognition software engines with a brief synopsis of characteristics.The following lists open-source applications that provide convenient user interfac...
Subvocal recognition
Subvocal recognition (SVR) is the process of taking subvocalization and converting the detected results to a digital output aurally or text-based.
A set of electrodes are attached to the skin of t...
Subvocal recognition - Wikipedia
Acoustic Model
An acoustic model is used in Automatic Speech Recognition to represent the relationship between an audio signal and the phonemes or other linguistic units that make up speech. The model is learned fro...
RWTH ASR
RWTH ASR (short RASR) is a proprietary speech recognition toolkit.The toolkit includes newly developed speech recognition technology for the development of automatic speech recognition systems. It has...
Lexical Markup Framework
ISO 24613:2008, Language resource management - Lexical markup framework (LMF), is the ISO International Organization for Standardization ISO/TC37 standard for natural language processing (NLP) and mac...
Lexical Markup Framework - Wikipedia
Keyword spotting
Keyword spotting is a subfield of speech recognition that deals with the identification of keywords in utterances.There are several types of keyword spotting:Keyword spotting in unconstrained speech a...
Dragon NaturallySpeaking
Dragon NaturallySpeaking (also known as Dragon for PC, or DNS), is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, and later acquired by Nuance Communicati...
Dragon NaturallySpeaking - Wikipedia
Spectral modeling synthesis
Spectral modeling synthesis or simply SMS is an acoustic modeling approach for speech and other signals.SMS considers sounds as a combination of harmonic content and noise content. Harmonic component...
Time-inhomogeneous hidden Bernoulli model
Time-inhomogeneous hidden Bernoulli model (TI-HBM) is an alternative to hidden Markov model (HMM) for automatic speech recognition. Contrary to HMM, the state transition process in TI-HBM is not a Mar...
Time-inhomogeneous hidden Bernoulli model - Wikipedia
Voice Navigator
The Voice Navigator was the first voice recognition device for command and control of a graphical user interface (Patent no. 5377303). The system was originally designed for the Apple Macintosh Plus a...
Plum Voice
The Plum Group, Inc. (DBA Plum Voice) is a company that provides interactive voice response platforms, systems and hosting services to developers and companies to automate call center and business pro...
Plum Voice - Wikipedia
SpeechWorks
SpeechWorks was a company founded in the mid-1990s in Boston that developed and supported speech-related computer software. The company was purchased in mid-2003 by Peabody, Massachusetts-based Nuance...
Vocapia Research
Vocapia Research, formerly Vecsys Research, is a high tech research and development company (R&D), developing technologies for multilingual, unconstrained speech-to-text transcription systems, au...
Vocapia Research - Wikipedia
VoxForge
VoxForge is a free speech corpus and acoustic model repository for open source speech recognition engines.VoxForge was set up to collect transcribed speech to create a free GPL speech corpus for use w...
N-gram
In the fields of computational linguistics and probability, an n-gram is a contiguous sequence of n items from a given sequence of text or speech. The items can be phonemes, syllables, letters, words ...
Stenomask
A stenomask is a hand-held microphone built into a padded, sound-proof enclosure that fits over the speaker's mouth or nose and mouth. Some lightweight versions may be fitted with an elastic neck str...
Stenomask - Wikipedia