Skip to content
View dzungphieuluuky's full-sized avatar
πŸ’­
Hallucinating...
πŸ’­
Hallucinating...

Highlights

  • Pro

Block or report dzungphieuluuky

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dzungphieuluuky/README.md

🌟 dzungphieuluuky

Deep Learning | Reinforcement Learning | Diffusion Modeling

Undergraduate student passionate about advancing generative AI through the intersection of reinforcement learning and diffusion models. I'm also interested in how representation learning can be improved through intelligent neural architecture design.


🎯 About Me

My interest includes but is not limited at:

  • Diffusion Models: Interested in this architecture recently due to its inspiration from physics, specifically thermodynamics.
  • Reinforcement Learning: Exploring exploration-exploitation trade-off and designing more approaches to leverage experience better for decision-making.
  • Representation Learning: Designing novel architectures with advanced learning capabilities for high-dimensional data distributions.

πŸ› οΈ Technical Toolkit

Languages

Python, C++, German (Duolingo)

AI/ML Frameworks

  • Core: PyTorch, TensorFlow, transformers
  • Specialization: diffusers, Ray RLlib, Stable Baselines3
  • Mathematics and MlOps: NumPy, SciPy, Pandas, Scikit-learn, wandb

Portfolio β†’


πŸ’‘ What I'm Learning

  • deep-ml.com: Solving daily quests to avoid corruption from vibe coding
  • German Language: Duolingo, learning basic vocabulary
  • Diffusion modeling: How diffusion works, actually?
  • Deep reinforcement learning: Reference Reinforcement Learning: An Introduction by Sutton & Barto

πŸ“« Connect With Me

I love discussing deep learning stuff and its mysteries:

Platform URL
🌐 Website dzungphieuluuky
πŸ“§ Email Work Email
πŸ’Ό LinkedIn Dung Pham

Pinned Loading

  1. RushHour RushHour Public

    Benchmark and Comparison on several searching algorithms including uninformed search and informed search. Visualization is presented through Rush Hour game. This is the first project in the course …

    Python 1

  2. WumpusWorld WumpusWorld Public

    Knowledge-based AI Agent with reasoning built with knowledge-based full resolution. Visualization via Wumpus World game.

    Python 1

  3. Finetune-DeepSeekOCR Finetune-DeepSeekOCR Public

    Finetuning DeepSeek-OCR on a small dataset with unsloth notebook. Assignment 2 Introduction to Natural Language Processing HCMUS.

    Jupyter Notebook 1

  4. EnergyRL EnergyRL Public

    Deep Reinforcement Learning for training an agent to manage and control power output in a network cells with Soft Actor Critic algorithm.

    Python 1

  5. OuroTrace OuroTrace Public

    Benchmark and evaluation ByteDance Ouro model based on Looped Language Models on several reasoning tasks.

    Python 1