hadoop - How to escape characters in pig -
i new pig , having trouble loading data pig. have data below:
example.txt
name,id,address,age basu,1,[73,dvg,india],24 nithi,2,[78,bang,india],24
in this, delimiter comma(,) , address field contains comma(,). need escape comma's in address field. please me this
thanks in advance basu
as per @basu comments if address can represented in "" instead of [] below approach can used.
pig script :
register piggybank.jar; student_details = load 'student.csv' using org.apache.pig.piggybank.storage.csvloader() (name:chararray,id:long,address:chararray, age:long); student_address = foreach student_details generate address; dump student_address;
input : student.csv
basu,1,"73,dvg,india",24 nithi,2,"78,bang,india",24
output : dump student_address
(73,dvg,india) (78,bang,india)
we can use either : org.apache.pig.piggybank.storage.csvexcelstorage() or org.apache.pig.piggybank.storage.csvloader().
refer below api links details
Comments
Post a Comment