Archive

Posts Tagged ‘Input’

Managed .NET Speech API links

(this is my answer at http://stackoverflow.com/questions/14771474/voice-recognition-in-windows)

I’m looking into adding speech recognition to my fork of Hotspotizer Kinect-based app (http://github.com/birbilis/hotspotizer)

After some search I see you can’t markup the actionable UI elements with related speech commands in order to simulate user actions on them as one would expect if Speech input was integrated in WPF. I’m thinking of making a XAML markup extension to do that, unless someone can point to pre-existing work on this that I could reuse…

The following links should be useful:

http://www.wpf-tutorial.com/audio-video/speech-recognition-making-wpf-listen/

http://www.c-sharpcorner.com/uploadfile/mahesh/programming-speech-in-wpf-speech-recognition/

http://blogs.msdn.com/b/rlucero/archive/2012/01/17/speech-recognition-exploring-grammar-based-recognition.aspx

https://msdn.microsoft.com/en-us/library/hh855387.aspx (make use of Kinect mic array audio input)

http://kin-educate.blogspot.gr/2012/06/speech-recognition-for-kinect-easy-way.html

https://channel9.msdn.com/Series/KinectQuickstart/Audio-Fundamentals

https://msdn.microsoft.com/en-us/library/hh855359.aspx?f=255&MSPPError=-2147217396#Software_Requirements

https://www.microsoft.com/en-us/download/details.aspx?id=27225

https://www.microsoft.com/en-us/download/details.aspx?id=27226

http://www.redmondpie.com/speech-recognition-in-a-c-wpf-application/

http://www.codeproject.com/Articles/55383/A-WPF-Voice-Commanded-Database-Management-Applicat

http://www.codeproject.com/Articles/483347/Speech-recognition-speech-to-text-text-to-speech-a

http://www.c-sharpcorner.com/uploadfile/nipuntomar/speech-to-text-in-wpf/

http://www.w3.org/TR/speech-grammar/

https://msdn.microsoft.com/en-us/library/hh361625(v=office.14).aspx

https://msdn.microsoft.com/en-us/library/hh323806.aspx

https://msdn.microsoft.com/en-us/library/system.speech.recognition.speechrecognitionengine.requestrecognizerupdate.aspx

http://blogs.msdn.com/b/rlucero/archive/2012/02/03/speech-recognition-using-multiple-grammars-to-improve-recognition.aspx

Kinect for Xbox 360 and Kinect for Windows (KfW) v1 specs

Picture

JJ131033.k4w_sensor_2(en-us,IEB.10).png

picture

1) 3D Depth sensor (IR Emitter + IR Camera / Depth Sensor)

2) RGB camera (Color Sensor)

3) Microphone array

4) Tilt motor (for detecting floor and players in the playspace)

 

Kinect Specifications
Viewing angle Field of View (FoV): 43° vertical x 57° horizontal
Vertical tilt range ±27°
Frame rate (depth and color stream) 30 frames per second (FPS)
Audio format 16-kHz, 24-bit mono
pulse code modulation (PCM)
Audio input characteristics 4-microphone array
24-bit analog-to-digital converter (ADC)
onboard signal processing (including acoustic echo cancellation & noise suppression)
Accelerometer characteristics 2G/4G/8G accelerometer configured for 2G range
1° accuracy detail limit
(can help detect when the sensor is in an unusual orientation)

 

Sources:

https://en.wikipedia.org/wiki/Kinect

https://support.xbox.com/en-US/xbox-360/kinect/kinect-sensor-components

https://msdn.microsoft.com/en-us/library/jj131033.aspx

%d bloggers like this: