Keywords-Forward

Author: Nachiket Subbaraman
Using a corpus of 1.5 million computer science research documents from kaggle and an oxford dictionary of common computer science phrases, I made a Python server to index and store these documents using Whoosh. I created a second Whoosh index to store more documents; each document has a computer science phrase as a title and a corresponding dictionary as its content, which contains the title’s related keywords as keys and a numerical ranking representing how related they are to the title as values. Users enter computer science phrase queries in the frontend HTML, then the server uses Whoosh to search for the query from the titles of the second Whoosh index and retrieve that title’s content and displays the dictionary in the HTML browser.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
whoosh_keywords_index.html		whoosh_keywords_index.html
whoosh_one_mil.py		whoosh_one_mil.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Keywords-Forward

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Keywords-Forward

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages