Ruby Capstone Project of Microverse, which students have to complete a real-world-like project within 72 hours according to this project specifications
It's a 3-in-1 Web Scraper, which allows users to parse all courses from udacity.com and jobs from indeed.com and remote.io into CSV file.
Job Scraped from Remote.io Job Scraped from indeed.com Courses Scraped from udacity.com- Ruby
- Nokogiri gem
- HTTParty gem
├── README.md
├── bin
│ └── main.rb
└── lib
└── scraper.rb
└── udacity_scraper.rb
└── indeed_scraper.rb
└── remoteio_scraper.rb
└── rspec
└── scraper_spec.rb
└── spec_helper.rb
Feel free to check out this link for a 3min video walkthrough :)
- Git clone this repo and cd the to the
web_scraperdirectory. - Run
bundle installin command line to install Nokogiri and HTTParty Gem. - Run
bin/main.rb. - Input either 'udacity', 'indeed', or 'remote.io' and follows the respective commands.
- Tada! 'udacity_courses.csv', 'indeed_jobs.csv', or 'remote_io.csv' would be created at the root directory respectively :)
- Git clone this repo and cd the to the
web_scraperdirectory. - Install rspec with
gem install rspec. - Run
rspecin Command Line. - You would see failures because all 3 scraped files haven't been created yet.
- To solve it, run
ruby bin/main.rband input 'udacity', 'indeed', and 'remote.io' for every execution. - Run
rspecin CLI again. The test cases would success upon each file created :)
👤 Kyle Law
Contributions, issues and feature requests are welcome!
Feel free to check the issues page.
Give a ⭐️ if you like this project!
- Microverse
- Nokogiri gem
- HTTParty Parser
- Udacity.com
- Indeed.com
- Remote.io
This project is MIT licensed.



