hadoop - How to escape characters in pig -


i new pig , having trouble loading data pig. have data below:

example.txt

name,id,address,age basu,1,[73,dvg,india],24 nithi,2,[78,bang,india],24 

in this, delimiter comma(,) , address field contains comma(,). need escape comma's in address field. please me this

thanks in advance basu

as per @basu comments if address can represented in "" instead of [] below approach can used.

pig script :

register piggybank.jar;  student_details = load 'student.csv' using org.apache.pig.piggybank.storage.csvloader() (name:chararray,id:long,address:chararray, age:long);  student_address = foreach student_details generate address; dump student_address; 

input : student.csv

basu,1,"73,dvg,india",24 nithi,2,"78,bang,india",24 

output : dump student_address

(73,dvg,india) (78,bang,india) 

we can use either : org.apache.pig.piggybank.storage.csvexcelstorage() or org.apache.pig.piggybank.storage.csvloader().

refer below api links details

  1. http://pig.apache.org/docs/r0.12.0/api/org/apache/pig/piggybank/storage/csvexcelstorage.html
  2. http://pig.apache.org/docs/r0.9.1/api/org/apache/pig/piggybank/storage/csvloader.html

Comments

Popular posts from this blog

powershell Start-Process exit code -1073741502 when used with Credential from a windows service environment -

twig - Using Twigbridge in a Laravel 5.1 Package -

c# - LINQ join Entities from HashSet's, Join vs Dictionary vs HashSet performance -