[VoIP] More fun with speech recognition

Russ Price kxt at fubegra.net
Wed Apr 9 21:25:38 CDT 2008


I've been playing around with another speech recognition system, based 
on the CMU Sphinx2 engine.  A handy little package with an AGI 
application and a Perl server program connects Asterisk to Sphinx.

Advantages:
* Does not rely on an outside server that could disappear or hide behind 
a paywall at any time
* Software is free

Disadvantages:
* Very difficult to set up
* Fantastically complicated recognition engine with documentation best 
suited for computer speech gurus (not I!)
* Voice recognition from a carbon mic (like my trusty WECO 2500) is 
spotty at best - condenser mics are much better
* Likely to have problems with UK English - or even with southern US accents

I'm running the speech software on one of my Ubuntu MythTV systems, 
since it has a very fast CPU (Core 2 Duo E8400), and Ubuntu has 
precompiled sphinx2 packages.

The SPREC package that makes Sphinx2 usable with Asterisk is at 
<http://uc.org/read/sprec>.  The CMU Sphinx programs themselves are at 
<http://cmusphinx.sourceforge.net/html/cmusphinx.php>.

I'm definitely *not* going to do any handholding regarding how to set 
this all up - it's highly experimental, and very subject to tweaking - 
especially if I can somehow figure out a way to make it work better with 
carbon transmitters. It is *not* a drop-in replacement for app_asr in 
any way, shape, or form.

Oddly, my North Electric 6H6 performs slightly better than the 2500 with 
this setup - but my Uniden cordless phone works best.  I found that a 
yes/no confirmation step was absolutely vital due to the recognition 
problems, at any rate, and I run separate recognizers for digits and for 
yes/no questions.

I've set up a CNET robo-operator at +1 442 9901 if you want to test it 
out.  It expects the full CNET number including the country code.

I've noticed that the LumenVox engine is now available for Asterisk, 
with a "starter kit" for $50 - and prices that go up steeply for greater 
capabilities.  <http://www.lumenvox.com/partners/digium/asterisk.aspx>

	Russ
	+1 442 7877


More information about the VoIP mailing list