• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Paul Clapham
  • Ron McLeod
  • Bear Bibeault
  • Liutauras Vilda
Sheriffs:
  • Jeanne Boyarsky
  • Junilu Lacar
  • Henry Wong
Saloon Keepers:
  • Tim Moores
  • Stephan van Hulst
  • Jj Roberts
  • Tim Holloway
  • Piet Souris
Bartenders:
  • Himai Minh
  • Carey Brown
  • salvin francis

Interview question on how to convert unstructured data into structured data

 
Ranch Foreman
Posts: 2037
12
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I was asked a question during interview that how is unstructured data converted into structured data. I said unstructured data needs further parsing but that doesnt seem to be the accurate reply to his question. I think there is no fixed answer to this question. How can unstructured data be converted into structured data. Thanks
 
Marshal
Posts: 71028
291
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
When they asked you that question, did you ask them for further explanation?
 
Monica Shiralkar
Ranch Foreman
Posts: 2037
12
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator

Thanks. No. I said that for processing unstructured data we require further breaking it down by parsing.
 
Campbell Ritchie
Marshal
Posts: 71028
291
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
You should ask for more information before answering.
 
Monica Shiralkar
Ranch Foreman
Posts: 2037
12
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator

Campbell Ritchie wrote:You should ask for more information before answering.



Thanks. Yes.

In general how is unstructured data converted into structured  data?
 
Sheriff
Posts: 15994
265
Mac Android IntelliJ IDE Eclipse IDE Spring Debian Java Ubuntu Linux
  • Likes 2
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I don't think your strategy of asking the same question over and over is going to work. The point of the answers you've gotten so far is that there's not enough context in the question to give an intelligent answer. I think that was also the point of the interviewer in asking the question, to see if you could intelligently dig for more context and not just say the first thing that popped into your head.
 
Monica Shiralkar
Ranch Foreman
Posts: 2037
12
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator

Junilu Lacar wrote:I think that was also the point of the interviewer in asking the question, to see if you could intelligently dig for more context and not just say the first thing that popped into your head.



Thanks.
 
Ranch Foreman
Posts: 28
3
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Here are a few questions you could ask:

  • What kind of data is it e.g. binary, text, what encoding, etc?
  • Where does it come from e.g.files, streams, etc?
  • Does it have a known structure e.g. a schema?
  • If it has a schema, is that schema consistent, or does each record have a different structure?
  • What kind of structured data do you want to produce - CSV, SQL, JSON, Avro, XML etc?


  • And so on...
     
    Consider Paul's rocket mass heater.
    reply
      Bookmark Topic Watch Topic
    • New Topic