Skip to content

Dishant27/Dishant_Portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

📊 Dishant Kumar - Data Science & AI Portfolio with Focus on Cutting-Edge Technologies

Data Science Machine Learning AI

A collection of my data science projects showcasing skills in machine learning, deep learning, NLP, and more.

🚀 About This Portfolio

This repository serves as a comprehensive showcase of my data science and machine learning projects. Each project demonstrates different skills and techniques in the field of data science, from exploratory data analysis to model deployment. I'm passionate about turning data into actionable insights and building intelligent systems that solve real-world problems.

📚 Featured Projects

Football Analysis

Overview: A data-driven approach to football player analysis, team building, and performance prediction using machine learning.

Key Features:

  • Data-driven player evaluation across key performance metrics
  • Squad optimization tools based on budget, playing style, and formation
  • Performance prediction models for player potential
  • Interactive visualization of player statistics
  • Budget allocation strategies for maximizing team efficiency

Technologies Used: Python, pandas, scikit-learn, matplotlib, seaborn

Impact: Provides football clubs and scouts with an objective, systematic analysis of player performance, helping optimize recruitment strategy and budget allocation.

Age vs Potential


Overview: An end-to-end data science pipeline to predict data scientist salaries based on job postings from Glassdoor.

Key Features:

  • Automated web scraping of Glassdoor job listings using Selenium
  • Text analysis of job descriptions using NLP techniques
  • Feature engineering from raw job posting data
  • Multiple regression models to predict salary ranges
  • Interactive visualization of salary trends across locations and skills

Technologies Used:

  • Python, Selenium, BeautifulSoup
  • pandas, NumPy for data processing
  • scikit-learn for machine learning
  • matplotlib, seaborn for visualization
  • Flask for deployment

Project Status:

  • ✅ Implemented Glassdoor scraper
  • ✅ Collected initial dataset
  • 🔄 Data cleaning and preprocessing in progress
  • 🔄 Feature engineering in development

Overview: An end-to-end machine learning application predicting diabetes risk based on diagnostic measurements, deployed as a web application.

Key Features:

  • Analysis of Pima Indians Diabetes Database
  • Comprehensive data exploration and visualization
  • Implementation of multiple machine learning algorithms
  • Web application with Flask
  • Active deployment on Heroku

Technologies Used: Python, pandas, scikit-learn, Flask, Heroku

Live Demo: Diabetes Predictor App


Overview: NLP project detecting hate speech and offensive language in Twitter posts with high accuracy.

Key Features:

  • Binary classification model for tweet sentiment
  • Advanced NLP preprocessing for social media text
  • Comparison of multiple classification algorithms
  • Achieved 95% accuracy with logistic regression

Technologies Used: Python, NLTK, scikit-learn, pandas


Overview: Deep learning project implementing CNNs for image classification using TensorFlow and Keras.

Key Features:

  • Convolutional Neural Network architecture
  • Data augmentation techniques
  • Transfer learning approaches
  • Interactive visualization of model performance

Technologies Used: Python, TensorFlow, Keras, numpy, matplotlib


Overview: Intelligent conversational agent built with Python, NLTK, and TensorFlow for natural language understanding.

Key Features:

  • Natural language understanding capabilities
  • Intent classification with neural networks
  • Response generation from knowledge base
  • Extensible architecture for new intents

Technologies Used: Python, NLTK, TensorFlow, Keras

📈 Skills Demonstrated

This portfolio showcases my proficiency in:

  • Data Analysis & Visualization: pandas, NumPy, matplotlib, seaborn
  • Machine Learning: scikit-learn, regression, classification, clustering
  • Deep Learning: TensorFlow, Keras, CNNs
  • Natural Language Processing: NLTK, text classification, sentiment analysis
  • Web Development: Flask, API development
  • Deployment: Heroku, web applications
  • Data Collection: Web scraping, API integration
  • Programming: Python, object-oriented design

💻 Technical Expertise

Languages & Frameworks

  • Python: Advanced proficiency with 5+ years experience
  • SQL: Expert in database queries and optimization
  • R: Working knowledge for statistical analysis
  • JavaScript: Basic proficiency for web visualization
  • Flask/Django: Web application development
  • TensorFlow/PyTorch: Deep learning model implementation

Data Engineering

  • ETL Pipeline Design: Automated data collection and transformation workflows
  • Data Warehousing: Schema design and optimization for analytical queries
  • Database Systems: MySQL, PostgreSQL, MongoDB
  • Big Data Technologies: Spark, Hadoop ecosystem
  • API Development: RESTful services for data access and model serving

Machine Learning Operations

  • Model Versioning: Tracking experiments and model iterations
  • Automated Testing: Validation frameworks for model performance
  • CI/CD for ML: Continuous integration and deployment pipelines
  • Model Monitoring: Drift detection and performance tracking
  • Containerization: Docker for reproducible environments

🔮 Future Projects

I'm constantly working on new projects. Here are some areas I'm exploring for future work:

  • Recommendation systems
  • Time series forecasting
  • Computer vision applications
  • Reinforcement learning
  • Big data processing with Spark
  • MLOps and model deployment pipelines

📧 Contact Information

Feel free to reach out if you'd like to discuss any of these projects or potential collaborations:


⭐ Thank you for visiting my data science portfolio! ⭐

About

My Data Science portfolio

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors