HAMLET is a holistic and automated framework for evaluating the long-context comprehension of large language models (LLMs).
# Navigate to project directory
cd HAMLET
# Install dependencies
pip install -r requirements.txt# Edit configuration with your API keys
vim config/config.yaml# Process a single book
python scripts/run_pipeline.py \
--input data/books/your_novel.txt \
--config config/config.yaml- Text Chunking: Split novels into manageable 4000-token chunks
- Tree Extraction: Extract hierarchical key facts (narrative & analytical)
- Tree Validation: Validate facts across three dimensions
- Query Generation: Generate targeted queries based on validated trees
- Summary Generation: Create summaries using multiple language models
- Summary Evaluation: Evaluate faithfulness, completeness, and conciseness
- Evaluation Processing: Advanced analysis of evaluation results
- Metrics Analysis: Generate position-based and comparative metrics