Skip to content

Latest commit

 

History

History
6 lines (3 loc) · 393 Bytes

File metadata and controls

6 lines (3 loc) · 393 Bytes

####NoSQL and DataBase

  • The raw data consists of four input files from Lahman’s Baseball Database that contains complete batting and pitching statistics from 1871 to 2019.

  • Individually wrote a Map Reduce program, a Spark program, a MongoDB program to find all player/team/year triples such that the player hit more home runs than the team in that year. Saved the output into HDFS.