maemo.org - Talk

maemo.org - Talk (https://talk.maemo.org/index.php)
-   General (https://talk.maemo.org/forumdisplay.php?f=7)
-   -   Any Success in speech recognition on Maemo? (https://talk.maemo.org/showthread.php?t=58099)

McLightning 2010-07-09 09:13

Any Success in speech recognition on Maemo?
 
hi everybody,
i have been search for a speech recognition software that works on maemo.
i have seen a lota topics about this but there was no solution or any sign that someone had succeed neither.

i wanna use speech recognition ability in an n810 controlled robot project.
it would be great to have such a feature.

regards.

pawpawyoung 2010-07-09 10:09

Re: Any Success in speech recognition on Maemo?
 
Hi, Buddy, I've ever ported sphinx to N800, and of course it should be workable for Maemo 5.

pfontana77 2010-07-09 10:27

Re: Any Success in speech recognition on Maemo?
 
Hello pawpawyoung,
do you think that is ever possible to use sphinx as a component for voice dialing in N900?

I see that this was taken in consideration in a brainstorming, but i don't see any follow up.
http://maemo.org/community/brainstor...oice_commands/

McLightning 2010-07-09 19:38

Re: Any Success in speech recognition on Maemo?
 
no actually i got an n810 too
can you share the information about how to install it?
pleaseeeeee pleasee im begging you

radiochickenwax 2010-07-09 19:50

Re: Any Success in speech recognition on Maemo?
 
Aren't there many other threads on this topic already? We really need to get more focused.

Pocketsphinx works on n900 from http://0xab.com/?p=93

Need to enable repository from instructions there, and then you're in for some effort to make it work. Need to build your own UI, which isn't too hard, but time consuming. AFAIK, nobody's packaged up a nice GUI yet.

I'd really like to be able to plug pocketsphinx into emacs for dictation/command execution. You could send SMS and call contacts easily this way. http://sumoudou.org/%E7%9B%B8%E6%92%...U%20Emacs.html The easiest way I can think of doing this is using a language model that simulates a keyboard, but haven't kludged this up yet, nor seen anyone else who has.

Pocketsphinx also works well on n8xx from what I've seen. http://www.youtube.com/watch?v=OEUeJb6Pwt4 ...but this is still far from the usability of google's cloud based speech recognition. Nice that you don't have to send it all over the web though, and it's much more open/configurable if you have the time/effort.

karimko 2010-07-24 12:32

Re: Any Success in speech recognition on Maemo?
 
Hello,

I've had a run with the repository on 0xab.com
Couldn't get things to work.
* Here's what i tried: sudo apt-get install chicken-qt
I got a problem with libqt4-maemo5-dev
* I tried to install the sphinx packages found here (http://0xab.com/n900/repository.html) under voice recognition.
I also had problem with dependencies...

Could anyone give us a step-by-step? Getting voice recognition up and running would be cool! Perhaps it's a problem with PR1.2?

I have worked on sphinx4 using Java, but all under windows.

Anyway any help would be appreciated.

Thanks,

kinipyon 2010-07-25 10:50

Re: Any Success in speech recognition on Maemo?
 
Quote:

Originally Posted by radiochickenwax (Post 746433)
Pocketsphinx also works well on n8xx from what I've seen. http://www.youtube.com/watch?v=OEUeJb6Pwt4 ...

I hacked this to N900. Works quite nice.

If I have more time maybe I will implement some basic voice dialing, shouldn't be that hard.

If anyone wants to try it, follow http://0xab.com/?p=93. Please note that you need to change alsasrc to pulsesrc.

karimko 2010-07-25 10:58

Re: Any Success in speech recognition on Maemo?
 
Quote:

Originally Posted by kinipyon (Post 764018)
I hacked this to N900. Works quite nice.

If I have more time maybe I will implement some basic voice dialing, shouldn't be that hard.

If anyone wants to try it, follow http://0xab.com/?p=93. Please note that you need to change alsasrc to pulsesrc.

kinipyon,

Great news! Could you tell us how to get this working on the n900, perhaps a small step by step? I have had trouble with dependencies and packages, and would like to experiment too :)

Any help would surely be appreciated.

Thanks in advance :)

imperiallight 2010-07-25 14:56

Re: Any Success in speech recognition on Maemo?
 
Have you seen this voice activated browser:

http://www.youtube.com/watch?v=gjIUv84U_E0

Wikiwide 2011-01-13 02:52

Re: Any Success in speech recognition on Maemo?
 
Quote:

Originally Posted by karimko (Post 764023)
kinipyon,

Great news! Could you tell us how to get this working on the n900, perhaps a small step by step? I have had trouble with dependencies and packages, and would like to experiment too :)

Any help would surely be appreciated.

Thanks in advance :)

http://code.google.com/p/voximp/
Download voximp.tar.gz to anywhere (MyDocs included). Unpack it (I did it in Filebox).

Install through apt-get

gstreamer0.10-pocketsphinx
pocketsphinx-hmm-en-hub4wsj
python-pocketsphinx

(I suppose it's bare minimum)

In voximp.py change alsasrc to pulsesrc

Run in X Terminal
python /path/to/voximp.py

It will ask you to copy/move three files (included in voximp.tar.gz you unpacked earlier; something like 9****.dic) to /home/user/.config/voximp/

When most problems are solved, running in X Terminal
python /path/to/voximp.py
will give you
letter X
command X
/bin/sh: xdotool not found

It means that it listens and recognises, but lacks the way to give it to operating system as pressed key of the keyboard.

I have taken xdotool of Debian for armel processor.
http://packages.debian.org/stable/x11/xdotool
Here it is (just the binary):
http://sharesend.com/info/uqr8g
Works-for-me.
Of course, you could compile it from source and publish to Maemo Extras; you are welcome! I'm lazy.

Now, running in X Terminal

python /path/to/voximp.py &

Will give you, besides long output in X Terminal,
ability to prints letters, and execute commands (cut, copy and paste, I don't know what else) in any program

Warning: ' &' at the end of line means that voximp will work when you close the X Terminal

When you try to use X Terminal (kill voximp, for instance) later (with voximp running), you will get spaces and random letters (depends on level of noise around you, actually).

Good luck! I'm going to test voximp later. It works but it's kind of unusual.

EDIT:
I still cannot open Fennec by voice command.
The problem is, it can recognise Fennec as partial (if it's pronounced aloud by espeak), but after several partials, the final command ISN'T Fennec. It can be T or some other letter.

I don't have ideal testing conditions, though. I have loud computer keyboard, sand in N900 speakers, and dust in N900 microphone.
So it's not surprising that sphinx rarely gives correct recognition.

The problem is, how do partials work and why the 'Fennec' partial was swallowed instead of being used as final command?

EDIT: I have read the .dic file. The pronunciation (mine or dic's) is awful. But at least sometimes it recognises (final) when I say Fennec. And I stopped outputting partial. And:
If you want to use voximp to enter text, you need to find somewhere full corpus of your language unless you are going to spell it letter-by-letter.

And with Fennec, I don't know what command could open a new tab in existing window.


All times are GMT. The time now is 14:16.

vBulletin® Version 3.8.8