• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other Pie Elite all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Ron McLeod
  • Junilu Lacar
  • Paul Clapham
  • Liutauras Vilda
Sheriffs:
  • Jeanne Boyarsky
  • Rob Spoor
  • Bear Bibeault
Saloon Keepers:
  • Tim Moores
  • Tim Holloway
  • Piet Souris
  • Carey Brown
  • Stephan van Hulst
Bartenders:
  • Frits Walraven
  • fred rosenberger
  • salvin francis

Python for Data Analysis

 
Bartender
Posts: 962
  • Likes 1
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
Author/s    : Wes McKinney
Publisher   : O'Reilly Media
Category   : Other
Review by : Deepak Bala
Rating        : 7 horseshoes

I managed to catch an early release version of this book. It bears resemblance to another book from Oreilly - 'Mining the social web', (MTSW) although the examples in this book are not restricted to social web sites.

There are a couple of python libraries that the author takes you through before getting you to work on examples. The chapters on these libraries serve as references when you want to revisit technique X on data structure Y. The illustrations include analyzing bit.ly data / twitter tweets etc. To use any of these libraries you MUST know python.

What I like:

* Chapters explaining library functions serve as a good reference to come back to later.
* The author has taken time to point out pitfalls while using libraries. This includes tips on how to use them practically (Say reading a huge CSV file in patches and aggregating the result)
* Pragmatic examples based on data out there USDA food database / baby names

Areas for improvement:

* I really wish the illustrations and library usage were married together. The pandas library needs getting used to for a newcomer. When you read 10 pages of instructions on how to use the library, you forget what you read on page-1. It would have been much easier had the author explained them side by side.

* I don't think the introduction to python was necessary towards the end of the book. It seemed like a weird place to put it. If you wanted to introduce users to python before they use pandas and numpy, you'd rather want to put that as the first chapter and tell those who already know python to skip it. IMHO if you do not know python, following some of the examples will be harder since you need to learn python AND libraries like pandas.


Overall a nice book. I would recommend readers to read this book and then follow it up with MTSW. I dont think the author of MTSW used pandas on any of the use cases, so using the knowledge learned here should help (I've not read MTSW yet and searching the book for 'pandas' threw up no results)

More info at Amazon.com
 
You showed up just in time for the waffles! And this tiny ad:
Building a Better World in your Backyard by Paul Wheaton and Shawn Klassen-Koop
https://coderanch.com/wiki/718759/books/Building-World-Backyard-Paul-Wheaton
reply
    Bookmark Topic Watch Topic
  • New Topic