Skip to content

Scott-Huang/Academic_Search

Repository files navigation

Academic Engine

Draft work for establishing an academic search engine for users with no prior knowledge about the searching concept to better locate papers they are looking for and provide a general elaboration of the problem they are dealing with.


Environment

  • python==3.7

Used Packages

  • Whoosh
  • SpaCy
  • Gensim
  • Flask

Data

The dataset is a mirror of the original ArXiv data, which can be downloaded in Kaggle.

The computer science keywords set can be downloaded here.


Usage

The concept_search.ipynb notebook generates an engine storing the keywords appeared in each abstract, and provides a ranking function to display the search results.

The context_search.ipynb notebook is a draft work trying to provide an explanation of the relation between a query and searched concepts.

The word2vec.ipynb notebook generates a word and phrase embedding model.

After finishing all precomputations to set up the draft web engine, run app.py, inside the folder web_engine, to set up the testing server of the search engine.


A short demo

Here is a short demo for the interface of the engine.

About

Draft work for building an search engine that helps users to locate needed papers without prior knowledge in relavant field

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors