Skip to content

mikesdatawork/ai-ml-datasets-hub

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DataFlux: AI/ML Dataset Nexus

A comprehensive collection of high-quality datasets specifically curated for artificial intelligence and machine learning workflows.

🔍 Overview

This repository provides a structured catalog of public datasets ideal for:

  • Training and fine-tuning machine learning models
  • Building data pipelines and ETL processes
  • Benchmarking algorithm performance
  • Practicing data cleaning and preprocessing techniques
  • Supporting research in various AI domains

📊 Dataset Categories

Browse our dataset collections by type:

🛠️ Data Pipeline Resources

Data Processing Guides

Example Notebooks

📚 General Dataset Resources

For a complete list of dataset sources, see our original dataset catalog.

🤝 Contributing

We welcome contributions! Please see our Contributing Guidelines for details on how to add datasets or examples.

📄 License

This catalog is available under the MIT License. Individual datasets may have their own licenses.