I want to update/delete some records in pig, I want to know how to achieve that in pig.
I want to update value of ID = 3 and delete record with ID =5 so that my expected table will have records like :
How to achieve the above result?
Post by:chris webster
Assuming your data is in files on HDFS, my understanding is that you cannot really do arbitrary in-place updates like you would with SQL in a relational database. You would probably need to read the data and modify the relevant records before writing it all back to HDFS. If you're using Hive or HBase to store your data, then maybe there are other options available, but in-place updates are not really what Hadoop is intended for.