Skip to content
View priyankak17's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Block or report priyankak17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
priyankak17/README.md

๐Ÿ‘‹ Hi, I'm Priyanka Kamila

Machine Learning Engineer & Applied Researcher

Multimodal Generative Systems โ€ข Video Understanding โ€ข World Models

Portfolio LinkedIn Email


๐Ÿš€ What I'm Building

Building next-generation generative AI systems that understand and create multimodal content from text to motion to photorealistic video.

๐ŸŽฌ Generative Video Research

๐Ÿ”น Transformer-based architectures for textโ†’video synthesis
๐Ÿ”น Diffusion models for temporal consistency and coherence
๐Ÿ”น Long-range dependency modeling in video sequences
๐Ÿ”น Cross-modal fusion for unified multimodal representations

โš™๏ธ Production ML Systems

๐Ÿ”ธ Distributed training pipelines (PyTorch DDP)
๐Ÿ”ธ ONNX & TensorRT deployment optimization
๐Ÿ”ธ Large-scale dataset engineering and preprocessing
๐Ÿ”ธ Model quantization and inference acceleration


๐Ÿ”ฌ Core Research Interests

โ”Œโ”€ Generative Video Systems
โ”‚  โ”œโ”€ ๐Ÿง  Transformer-based architectures
โ”‚  โ”œโ”€ ๐ŸŽจ Diffusion model research
โ”‚  โ”œโ”€ โฐ Temporal consistency modeling
โ”‚  โ””โ”€ ๐ŸŽฅ Text-to-motion-to-video synthesis
โ”‚
โ”œโ”€ World Models
โ”‚  โ”œโ”€ ๐ŸŒ Environment dynamics learning
โ”‚  โ”œโ”€ ๐ŸŽฎ Predictive modeling
โ”‚  โ””โ”€ ๐Ÿ”ฎ Temporal coherence
โ”‚
โ””โ”€ Multimodal AI
โ”œโ”€ ๐Ÿ”„ Cross-modal fusion
โ”œโ”€ ๐Ÿ‘๏ธ Vision-language models
โ””โ”€ ๐ŸŽต Audio-visual learning

โš™๏ธ Engineering Focus

โ”Œโ”€ Distributed Training
โ”‚  โ”œโ”€ โšก PyTorch DDP (8-GPU clusters)
โ”‚  โ”œโ”€ ๐Ÿ“Š Large-scale dataset pipelines
โ”‚  โ”œโ”€ ๐Ÿ”ฌ Systematic experimentation
โ”‚  โ””โ”€ ๐Ÿ“ˆ Mixed precision training
โ”‚
โ”œโ”€ Production ML
โ”‚  โ”œโ”€ ๐Ÿš€ ONNX deployment
โ”‚  โ”œโ”€ โšก TensorRT optimization
โ”‚  โ”œโ”€ ๐Ÿ“ฆ Model quantization
โ”‚  โ””โ”€ ๐ŸŽฏ Inference acceleration
โ”‚
โ””โ”€ Research Infrastructure
โ”œโ”€ ๐Ÿ”ง Experiment tracking
โ”œโ”€ ๐Ÿ“‰ Metric dashboards
โ””โ”€ ๐Ÿ› ๏ธ Reproducible pipelines

๐ŸŽฏ Technical Arsenal

Deep Learning & Generative AI

PyTorch TensorFlow Hugging Face Diffusion Models GANs Transformers

Computer Vision & Video

OpenCV CUDA Pose Estimation Temporal Modeling

Production & Optimization

ONNX TensorRT Docker Mixed Precision


๐Ÿ“Š Technical Depth Matrix

Domain Technologies Experience Level
๐Ÿง  Generative Modeling Diffusion โ€ข GANs โ€ข VAEs โ€ข Transformers โ€ข Temporal Consistency โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ 95%
๐ŸŽฌ Video & Motion Pose Estimation โ€ข Temporal Modeling โ€ข Sequence Alignment โ€ข Motion Synthesis โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘ 90%
๐Ÿ”„ Multimodal Systems Cross-modal Fusion โ€ข Vision-Language โ€ข Audio-Visual Learning โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘ 85%
โšก Distributed Training PyTorch DDP โ€ข 8-GPU Clusters โ€ข Large-scale Pipelines โ€ข Experiment Tracking โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘ 88%
๐Ÿš€ Model Deployment ONNX โ€ข TensorRT โ€ข Quantization โ€ข Inference Optimization โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘ 82%

๐Ÿ† Notable Achievements

๐ŸŽ“ Academic

MSc Computer Vision,
Robotics & ML

University of Surrey

๐Ÿ“„ Dissertation: Cross-modal latent fusion for multimodal face generation

๐Ÿ“š Publications

2 Research Papers

โœจ Generative AI Architectures
(AI Accelerator Institute, 2024)

๐Ÿ”ฌ Computer Vision Systems
(IJESC, 2019)

๐Ÿ… Hackathon

AI Encode London Winner

โšก Built a real-time AI prototype in 48 hours

๐Ÿš€ Production-ready system demo


๐Ÿ’ก Featured Projects

๐ŸŽญ Cross-modal Face Generation

MSc Dissertation: Latent fusion for multimodal synthesis
Tech: VAEs โ€ข Cross-modal Fusion โ€ข PyTorch
Status: Published Research



๐Ÿค Open to Collaboration

I'm actively seeking opportunities to collaborate on cutting-edge research problems:

Research Area Specific Interests
๐ŸŒ World Models Environment dynamics โ€ข Predictive modeling โ€ข Spatial reasoning
๐ŸŽฌ Video Generation Temporal coherence โ€ข Motion synthesis โ€ข Controllable generation
๐Ÿ”„ Multimodal AI Cross-modal fusion โ€ข Unified representations โ€ข Audio-visual learning
โฐ Temporal Reasoning Long-range dependencies โ€ข Sequence modeling โ€ข Event prediction
๐ŸŽฎ Interactive Systems Real-time generation โ€ข Human-AI collaboration โ€ข Embodied AI

๐Ÿ“ Location & Contact

๐Ÿ“ London, United Kingdom
๐ŸŽ“ MSc Computer Vision, Robotics & ML @ University of Surrey
๐Ÿ”ฌ Researching at the intersection of generative AI, video understanding, and world models

Let's Connect!

Portfolio LinkedIn Email


โšก Building the future of generative AI, one model at a time โšก

Specializing in multimodal systems, video generation, and scalable ML infrastructure

Pinned Loading

  1. cross-modal-latent-fusion cross-modal-latent-fusion Public

    Cross-modal latent alignment and fusion for sketch + RGB face reconstruction using StyleGAN (MSc Dissertation, University of Surrey โ€“ CVRML).

    Python

  2. Gender-Classification-model-using-Support-Vector-Machines Gender-Classification-model-using-Support-Vector-Machines Public

    Gender Classification model using Support Vector Machines, in python Open-CV, using PCA Eigen-faces

    Jupyter Notebook

  3. inditex_mvp_architecture inditex_mvp_architecture Public

    HTML

  4. SIFT_Detector SIFT_Detector Public

    Python Code to detect SIFT keypoints in a given image

    Jupyter Notebook

  5. feemthan/aml_coursework feemthan/aml_coursework Public

    Jupyter Notebook