Skip to content

karenheckel/task1

Repository files navigation

Project Template - Top-K Hadoop Example.

Running on Laptop

Prerequisite:

  • Maven 3

  • JDK 1.6 or higher

  • (If working with eclipse) Eclipse with m2eclipse plugin installed

The java main class is:

edu.cs.utexas.HadoopEx.WordCountTopKDriver

Input file: Book-Tiny.txt

Specify your own Output directory like

Running:

Create a JAR Using Maven

To compile the project and create a single jar file with all dependencies:

mvn clean package

Run your application

Inside your shell with Hadoop

Running as Java Application:

java -jar target/topKHadoop-0.1-SNAPSHOT-jar-with-dependencies.jar SOME-Text-Fiel.txt intermediatefolder output

For example java -jar target/topKHadoop-0.1-SNAPSHOT-jar-with-dependencies.jar 20-news-same-line.txt intermediatefolder output

Or has hadoop application

hadoop jar your-hadoop-application.jar edu.cs.utexas.HadoopEx.WordCountTopKDriver arg0 arg1 arg2

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages