Self-Learning AI Summarization System

Overview

This project is a self-learning AI system that scrapes educational content from open educational resources, summarizes the content, evaluates the quality of summaries, and continuously improves its summarization capabilities through training.

Features

Web Scraping: Fetches articles from educational websites like MIT OpenCourseWare, Open UMN, and OER Commons
Content Processing: Handles both PDF and HTML content
Text Summarization: Creates concise summaries using extractive techniques
Quality Evaluation: Assesses summary quality based on multiple metrics
Self-Improvement: Learns from high-quality summaries to improve future results
Visualization: Tracks improvement metrics over time

System Components

Scraper: Fetches and extracts content from educational websites
Cleaner: Preprocesses and cleans raw text
Summarizer: Generates concise summaries from text
Evaluator: Assesses the quality of generated summaries
Trainer: Learns from high-quality examples to improve future summaries
Pipeline: Orchestrates the entire process

Setup

Clone this repository
Run the setup script to create the Python environment and install dependencies:

.\setup.ps1

Start the Next.js development server:

npm run dev

Usage

Access the web interface at http://localhost:3000
View AI improvement metrics at http://localhost:3000/ai-improvement
Use the API endpoints for programmatic access:
- /api/summarize: Generate a summary for provided text
- /api/train: Submit a training pair (text and summary)
- /api/metrics: Get system performance metrics

Directory Structure

/self_learning_ai: Core Python modules for the AI system
/app: Next.js web application
/data: Storage for articles, summaries, and training data
- /raw: Raw scraped content
- /summaries: Generated summaries
- /fine_tune: Training pairs
- /reports: Performance visualizations

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.vscode		.vscode
app		app
components		components
data		data
hooks		hooks
lib		lib
public		public
self_learning_ai		self_learning_ai
styles		styles
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.vercelignore		.vercelignore
CUsersyk214.DESKTOP-5OI0MOPOneDriveMasaüstüniviskarv0datatempupdate_homepage.py		CUsersyk214.DESKTOP-5OI0MOPOneDriveMasaüstüniviskarv0datatempupdate_homepage.py
FEATURE_UPDATE_SUMMARY.md		FEATURE_UPDATE_SUMMARY.md
README.md		README.md
cleanup_temp.bat		cleanup_temp.bat
components.json		components.json
desktop.ini		desktop.ini
download_nltk_data.py		download_nltk_data.py
fix_dependencies.py		fix_dependencies.py
fix_system.bat		fix_system.bat
install_python_deps.py		install_python_deps.py
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
run_all_tests.py		run_all_tests.py
setup.ps1		setup.ps1
tailwind.config.ts		tailwind.config.ts
test_api.py		test_api.py
test_environment.py		test_environment.py
test_imports.py		test_imports.py
test_new_features.py		test_new_features.py
test_pipeline.py		test_pipeline.py
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Learning AI Summarization System

Overview

Features

System Components

Setup

Usage

Directory Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Self-Learning AI Summarization System

Overview

Features

System Components

Setup

Usage

Directory Structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages