Hi All,
In our application we are migrating huge volume of data from teradata to Hive.
Need to validate the data between source and target.We are planning to do it using python & pandas dataframe.
My queries are
1.Will pandas data-frame can handle around 15 million of data ?
2.Is there any other way to do it ?
What is the best way to achieve the above using python ?
Thanks in advance