GitHub - vikram087/matsearch

Materials Science Paper Search Engine

This project currently has a single maintainer, so please anticipate some wait time for bug fixes and updates.

Overview

This project is a comprehensive search engine for materials science papers. It combines advanced search capabilities with natural language processing (NLP) for an efficient, user-friendly way to explore scientific literature in the materials science field.

Key Features

Vector Search: Uses the HNSW (Hierarchical Navigable Small World) algorithm for efficient similarity-based searches.
Advanced Sorting & Filtering: Sort results by relevance or date and filter by criteria like date range, and search terms.
Flexible Pagination: Customizable results per page for a smooth browsing experience.
Natural Language Processing: Sentence Transformer for embedding queries and MatBERT for extracting key properties from abstracts.
Caching for Speed: Redis caching optimizes performance, achieving a 91% improvement in fetching speed for cached results.
Fuzzy Search Capabilities: Enhances author and category searches, increasing search flexibility.

Architecture

Backend Stack

Python: Core scripting language.
Flask: Backend API framework.
Elasticsearch: Database for paper metadata and embeddings.
ArXiv API: Data source for paper metadata.
Sentence Transformer: Embeds query for similarity search.
Redis: Caching for high performance.
MatBERT: NER model for property extraction.

Frontend Stack

React.js: Library for a responsive and intuitive user interface.

Deployment

Docker: Containerization and local hosting of Elasticsearch.

Setup

⚡ Getting Started
Follow the README files below for manual setup instructions:

🐳 Docker Supported!

Docker Setup

Contact Information

For questions or bug reports, please contact the lead developer:

Lead Developer: Vikram Penumarti
Email: [email protected]

Contribution

This project is currently maintained by a single developer. If there is enough community interest, contributions may be opened to outside collaborators. For inquiries or to express interest in contributing, please reach out to [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 249 Commits
backend		backend
docker		docker
frontend		frontend
kubernetes		kubernetes
testing		testing
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Materials Science Paper Search Engine

Overview

Key Features

Architecture

Backend Stack

Frontend Stack

Deployment

Setup

Contact Information

Contribution

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Materials Science Paper Search Engine

Overview

Key Features

Architecture

Backend Stack

Frontend Stack

Deployment

Setup

Contact Information

Contribution

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages