Win a copy of Programmer's Guide to Java SE 8 Oracle Certified Associate (OCA) this week in the OCAJP forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

JSAPI - other languages than English; MIDlet - settings loading; JSAPI - algorithm application

 
Johny Wyenski
Greenhorn
Posts: 21
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hello!

I need to create application which uses speech recognition. At first I thought about using CMU Sphinx (PocketSphinx or, possibly, Sphinx4). Later I thought about JSAPI. (Sphinx somehow uses JSAPI but I don't know what is the difference between JSAPI and Sphinx).

I have read almost all (up to 6.7.9) of the following tutorial: http://java.sun.com/products/java-media/speech/forDevelopers/jsapi-guide.pdf. Unfortunately I couldn't've found one important thing, i.e. how to create acoustic model for other language than English?

Thanks in advance for your answers :-)!

PS There are some other things which I'd like to know:
1) How to load some settings from file (I guess nowadays configuration files are created with the use of XML but I dunno)?
2) How to maintain algorithm which is used by MIDlet which involves JSAPI? I mean there are some different things which my MIDlet needs to do. I guess it is good habit to divide different goals into separate parts of code (due to object-oriented programming).
In my case there are some different things:
a) speech recognition of audio input, i.e. changing input audio stream into output text string
b) analysis of that text string and according to this string choosing the proper transition in my algorithm
In general I have written my algorithm on sheet of paper and it takes about ten A4 sheets of paper. Because of it I thought there should be some way to write this algoritm maybe outside the code, in some kind of file which would contain this algorithm. Maybe there is other good way to implement this algorithm, not necesarilly in the code.
c) sending of results through httpconnection with the use of POST method
d) receiving in on TomCat on server
3) Which method should I use to receive the recognized speech? I found these:
a) FinalRuleResult, b) Result -> getBestToken, c) getSpokenText, d) ResultToken of RuleGrammar
4) Can you give me any full examples of JSAPI usage? (Not just short parts of code like in this JSAPI guide)?

Greetins :-)!
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic