Skip to content

Riounis/SparkAPITransformer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Part 0 Deliverables-

Sample program that alphabetizes data:

  • RDD API
  • Dataset API
  • Dataframe API

Algorithm:

  1. separate data by word
  2. aggregate arrays of words
  3. alphabetize array
  4. print out array

Sample program that returns the word with the highest count:

  • RDD API
  • Dataset API
  • Dataframe API

Algorithm:

  1. separate data by word
  2. map words to key value pairs where key = word and value = count
  3. aggregate key value pair counts
  4. print out word(s) with highest frequency

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •