I have one input text file with size around 800MB, which contains the capital word letters and small letter words. I am reading the text file and capturing all the capital letter words in to vector(obvioiusly size of this vector is big, let us assume 700MB).
now my problem is if i want to remove the dupicates in vector and i need to sort the vector, it is going to delay.. delay.... due to vast size.
it is taking so much time to process. if i try with small file eveything is ok!!!
for example ,if it is successful, i need to write this big size vector into flat file.
but this process is taking so much time to do that.
so please suggest the best way to do my scenario.
thanks in advance.