Win a copy of The Little Book of Impediments (e-book only) this week in the Agile and Other Processes forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Information Extraction

 
deepika deepi
Ranch Hand
Posts: 199
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
hi experts,

i need some java coding for Information Extraction in Java for tourism or any personality name it should give me the details regarding it on the output screen
if i have an paragraph of documents like this

October 14, 2002, 4:00 a.m. PT
For years, Microsoft Corporation CEO Bill Gates
railed against the economic philosophy of opensource
software with Orwellian fervor, denouncing
its communal licensing as a "cancer" that stifled
technological innovation.
Today, Microsoft claims to "love" the open-source
concept, by which software code is made public to
encourage improvement and development by
outside programmers. Gates himself says Microsoft
will gladly disclose its crown jewels--the coveted
code behind the Windows operating system--to
select customers.
"We can be open source. We love the concept of
shared source," said Bill Veghte, a Microsoft VP.
"That's a super-important shift for us in terms of
code access.“
Richard Stallman, founder of the Free Software
Foundation, countered saying



AND IT SHOULD EXTRACT THE OUTPUT.Eg:

October 14, 2002, 4:00 a.m. PT
For years, Microsoft Corporation CEO Bill Gates
railed against the economic philosophy of opensource
software with Orwellian fervor, denouncing
its communal licensing as a "cancer" that stifled
technological innovation.
Today, Microsoft claims to "love" the open-source
concept, by which software code is made public to
encourage improvement and development by
outside programmers. Gates himself says Microsoft
will gladly disclose its crown jewels--the coveted
code behind the Windows operating system--to
select customers.
"We can be open source. We love the concept of
shared source," said Bill Veghte, a Microsoft VP.
"That's a super-important shift for us in terms of
code access.“
Richard Stallman, founder of the Free Software
Foundation, countered saying

then the output will be high lighting the output window on the screen

NAME TITLE ORGANIZATION
Bill Gates CEO Microsoft
Bill Veghte VP Microsoft
Richard Stallman founder Free Software


can any one help me out to get this output ?



 
Hardik Trivedi
Ranch Hand
Posts: 252
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Well you must have to maintain database for the keywords. Read the file sequentially, then you need to check for every keywords for every line.

It is bit of tough though. to form the information which you want to build. But as of now you can start using simple string manipulation and file reading.
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic