Interview Questions   Tutorials   Discussions   Programs   Videos   

Pig - Word Count program in PIG?

asked marvit January 26, 2015 10:53 PM  

Word Count program in PIG?


3 Answers

answered By marvit   0  

-- Load input from the file named Mary, and call the single

-- field in the record 'line'.

input = load 'mary' as (line);

-- TOKENIZE splits the line into a field for each word.

-- flatten will take the collection of records returned by

-- TOKENIZE and produce a separate record for each one, calling the single

-- field in the record word.

words = foreach input generate flatten(TOKENIZE(line)) as word;

-- Now group them together by each word.

grpd = group words by word;

-- Count them.

cntd = foreach grpd generate group, COUNT(words);

-- Print out the results.

dump cntd;

   add comment

answered By marvit   0  

A = load '/home/accure/shamshad/input.txt';

B = foreach A generate flatten(TOKENIZE((chararray)$0)) as word;

C = group B by word;

D = foreach C generate COUNT(B), group; store D into './wordcount';



To run this pig script:

pig -x local /home/accure/shamshad/accure.pig

   add comment

answered By Experts-976   0  

Word Count program in PIG?

text= LOAD ‘text.txt’ USING TextLoader();
tokens=FOREACH text GENERATE FLATTEN (TOKENIZE($0)) as word;
wordcount = FOREACH (GROUP tokens BY word) GENERATE group as word, COUNT_STAR($1) as ct;
   add comment

Your answer

Join with account you already have



 Write A Tutorials
Online-Classroom Classes

  1 person following this question

  1 person following this tag

  Question tags

pig × 1

Asked 1 year and 10 months ago ago
Number of Views -355
Number of Answers -3
Last updated
1 year and 8 months ago ago

  Similar questions

Ready to start your tutorial with us? That's great! Send us an email and we will get back to you as soon as possible!