maemo.org - Talk

maemo.org - Talk (https://talk.maemo.org/index.php)
-   General (https://talk.maemo.org/forumdisplay.php?f=7)
-   -   voice recognition to text (https://talk.maemo.org/showthread.php?t=4909)

kelkinny2004 2007-02-24 05:23

voice recognition to text
 
Hi, I don't have a N800 yet, but I have been watching since the 770 and am very fascinated by its potential.

I wondered if there is an application for converting voice, from the mic to text on the screen.

My elderly father is really hard of hearing, he does have hearing aids, but I never know if he really hears me. He can still read though<G>.

Is there an application for this device to translate my voice into text, through the microphone, for him to see. I figure I would face the N800 toward him.

thanks for any help

mwiktowy 2007-02-24 05:37

Re: voice recognition to text
 
Quote:

Originally Posted by kelkinny2004 (Post 36291)
Hi, I don't have a N800 yet, but I have been watching since the 770 and am very fascinated by its potential.

I wondered if there is an application for converting voice, from the mic to text on the screen.

My elderly father is really hard of hearing, he does have hearing aids, but I never know if he really hears me. He can still read though<G>.

Is there an application for this device to translate my voice into text, through the microphone, for him to see. I figure I would face the N800 toward him.

thanks for any help

While there is a text-to-speech engine (flite) available, what you want is the other way around ... a speech recognition app. I have not heard of one for the N800. Unfortunately, I would suspect that there won't be since speech recognition is pretty CPU intensive and the N800 isn't a huge number cruncher. You never know though. Someone might find an efficient enough algorithm that matches the capability of the N800 so I would never say never.

mwiktowy 2007-02-24 05:47

Re: voice recognition to text
 
Quote:

Originally Posted by mwiktowy (Post 36292)
Someone might find an efficient enough algorithm that matches the capability of the N800 so I would never say never.

Doing some digging around, it looks like there is something that would be appropriate is someone could do some integration work for the N800 (or maybe even the 770.

http://www.speech.cs.cmu.edu/pocketsphinx/

msaunby 2007-02-24 10:23

Re: voice recognition to text
 
It's important not to forget that the 770 and N800 are *Internet* tablets. In the first instance if I were developing something like this I'd go for creating a service I could connect to - think media server style. Much easier to do the development that way. It might even create a better product - you could speak into one 770 and your father could read on another.

It might be worth suggesting such a product/service to these folks - http://www.spinvox.com/

Karel Jansens 2007-02-24 12:01

Re: voice recognition to text
 
Quote:

Originally Posted by mwiktowy (Post 36292)
While there is a text-to-speech engine (flite) available, what you want is the other way around ... a speech recognition app. I have not heard of one for the N800. Unfortunately, I would suspect that there won't be since speech recognition is pretty CPU intensive and the N800 isn't a huge number cruncher. You never know though. Someone might find an efficient enough algorithm that matches the capability of the N800 so I would never say never.

IBM did it on my Pentium 75 with 64 MB of RAM, ten years ago, with a software-only solution. Granted, that was OS/2, so the rest of the world will probably have to wait another decade.

So, no: voice recognition is not that CPU-intensive (these days). It is, however, quite algorithm-intensive, which seems to be what is lacking in the postmodern world.

konfoo 2007-02-24 14:17

Re: voice recognition to text
 
Well I have Sphinx compiled.. now its just a matter of figuring this blasted thing out and pointing the 64Mb speech base lib to the mmc... more news soon :)

konfoo 2007-02-24 16:54

Re: voice recognition to text
 
Ok this sucker is more time-intensive than I am willing to spend. If anyone wants to help out post to this thread. We need a Sphinx-expert to configure the speech templates and a /dev/dsp pocketsphinx_continuous script.

mwiktowy 2007-02-24 18:32

Re: voice recognition to text
 
The section of that website that caught my attention was:

Quote:

You can also download telephone-bandwidth models separately. To use these with raw audio data you need the following extra command-line options:

-nfft 256
-nfilt 31
-lowerf 200
-upperf 3500
-samprate 8000
Since the 770 and N800 seem to capture audio at 8000 Hz sampling rate (based on the maemorecorder abilities), this voice model might be the way to go. Plus it is only 8 MB or so rather than the 25 MB that you speak of.

They are available here:
http://www.speech.cs.cmu.edu/pockets...linterp.tar.gz

Karel Jansens 2007-02-24 18:53

Re: voice recognition to text
 
This is what caught my eye on the Carnegie-Mellon site (http://cmusphinx.sourceforge.net/html/cmusphinx.php):
"Note however that Sphinx is not a final product. Those with a certain level of expertise can achieve great results with the versions of Sphinx available here, but a naive user will certainly need further help. In other words, the software available here is not meant for users with no experience in speech, but for expert users."

Aren't we in over our heads here?

(BTW, the phrase "users with no experience in speech" is kinda funny)

konfoo 2007-02-24 19:32

Re: voice recognition to text
 
http://www.kiecza.net/daniel/linux/c...ndex.html#toc5 another option but there are some compilation issues related to the mixer.

mwiktowy 2007-02-24 20:15

Re: voice recognition to text
 
Quote:

Originally Posted by Karel Jansens (Post 36340)
Aren't we in over our heads here?

(BTW, the phrase "users with no experience in speech" is kinda funny)

Spek 4 u self, I spek gud and spel gudder :]

Seriously though, I would imagine that is just referring to the fact that it doesn't have a nice shiny polished end-user GUI and a zillion and commandline one options with little documentation.

It seems like a decent baseline for someone capable to develop a nice shiny Maemo GUI on top of. This paired with existing flite and the existing text predicition would make for a really nice tool for those with communication impairments a la Stephen Hawking's gadget. Not a huge market segment but an important one.

Coincidentally, this demographic is one that has desires for open, DRM-unencumbered formats that are shared with the Open Source software community.

Karel Jansens 2007-02-24 21:11

Re: voice recognition to text
 
Quote:

Originally Posted by mwiktowy (Post 36346)
Spek 4 u self, I spek gud and spel gudder :]

Seriously though, I would imagine that is just referring to the fact that it doesn't have a nice shiny polished end-user GUI and a zillion and commandline one options with little documentation.

It seems like a decent baseline for someone capable to develop a nice shiny Maemo GUI on top of. This paired with existing flite and the existing text predicition would make for a really nice tool for those with communication impairments a la Stephen Hawking's gadget. Not a huge market segment but an important one.

Coincidentally, this demographic is one that has desires for open, DRM-unencumbered formats that are shared with the Open Source software community.

I've never been a big fan of speech recognition (handwriting recognition is so much more discrete :p ), but I can picture myself couch-potatoeing with my NaB00, shouting stuff at it.

So, you go dude. I'm sold. :cool:

geneven 2007-02-25 00:19

Re: voice recognition to text
 
I think that there isn't a decent speech to text app in Linux at all. The only decent current ones I know of are Dragon NaturallySpeaking and the one that comes wth Vista.These can be used by pros for text dictation; you may be able to find something you can shout commands at. IBM is no longer developing its TTS.

sd_proto 2008-02-12 23:59

Re: voice recognition to text
 
It looks like this hasn't been touched for almost a year. But... It would be nice to have the ability to have some speech activated commands from a bluetooth earpiece....

RipTorn 2008-02-13 12:10

Re: voice recognition to text
 
there is sphinxbase and pocketsphinx are part of the mud-builder for maemo.

I have installed sphinxbase but assume you need pocketsphinx to use it correctly, plus I'm Australian and sphinx in Linux has about as much chance of understanding me as much as me kicking a tin can down the allyway and understanding the morse code.

I haven't been able to get the pocketsphinx to compile under mud but I only tried for about 5 minutes. sphinxbase seemed to work correctly.

-Rip

hakytae 2010-03-30 19:54

Re: voice recognition to text
 
Can anyone send a method to install correctly pocketsphinx on a device like nokie n810 for example?

Thanks to the person who already did this and share that with us.

Flynx 2010-04-12 06:47

Re: voice recognition to text
 
Android got voice to text i guess with the recent update. I played with it today on my girlfiends droid and it ROCKS. Recognized words like supercalafragalisticexbialadocious.

Plus I like the implementation. The onscreen keyboard got a mic button you can use anywhere you can enter text.

I know, I know, android isnt *truely* open source. But neither are half our drivers. Google deserves some respect for the user experience they are delivering.


All times are GMT. The time now is 06:42.

vBulletin® Version 3.8.8