Priyanka Kamila priyankak17

👋 Hi, I'm Priyanka Kamila

Machine Learning Engineer & Applied Researcher

Multimodal Generative Systems • Video Understanding • World Models

🚀 What I'm Building

Building next-generation generative AI systems that understand and create multimodal content from text to motion to photorealistic video.

🎬 Generative Video Research

🔹 Transformer-based architectures for text→video synthesis
🔹 Diffusion models for temporal consistency and coherence
🔹 Long-range dependency modeling in video sequences
🔹 Cross-modal fusion for unified multimodal representations

⚙️ Production ML Systems

🔸 Distributed training pipelines (PyTorch DDP)
🔸 ONNX & TensorRT deployment optimization
🔸 Large-scale dataset engineering and preprocessing
🔸 Model quantization and inference acceleration

🔬 Core Research Interests

┌─ Generative Video Systems
│  ├─ 🧠 Transformer-based architectures
│  ├─ 🎨 Diffusion model research
│  ├─ ⏰ Temporal consistency modeling
│  └─ 🎥 Text-to-motion-to-video synthesis
│
├─ World Models
│  ├─ 🌍 Environment dynamics learning
│  ├─ 🎮 Predictive modeling
│  └─ 🔮 Temporal coherence
│
└─ Multimodal AI
├─ 🔄 Cross-modal fusion
├─ 👁️ Vision-language models
└─ 🎵 Audio-visual learning

⚙️ Engineering Focus

┌─ Distributed Training
│  ├─ ⚡ PyTorch DDP (8-GPU clusters)
│  ├─ 📊 Large-scale dataset pipelines
│  ├─ 🔬 Systematic experimentation
│  └─ 📈 Mixed precision training
│
├─ Production ML
│  ├─ 🚀 ONNX deployment
│  ├─ ⚡ TensorRT optimization
│  ├─ 📦 Model quantization
│  └─ 🎯 Inference acceleration
│
└─ Research Infrastructure
├─ 🔧 Experiment tracking
├─ 📉 Metric dashboards
└─ 🛠️ Reproducible pipelines

🎯 Technical Arsenal

Deep Learning & Generative AI

Computer Vision & Video

Production & Optimization

📊 Technical Depth Matrix

Domain	Technologies	Experience Level
🧠 Generative Modeling	Diffusion • GANs • VAEs • Transformers • Temporal Consistency	████████████ 95%
🎬 Video & Motion	Pose Estimation • Temporal Modeling • Sequence Alignment • Motion Synthesis	███████████░ 90%
🔄 Multimodal Systems	Cross-modal Fusion • Vision-Language • Audio-Visual Learning	██████████░░ 85%
⚡ Distributed Training	PyTorch DDP • 8-GPU Clusters • Large-scale Pipelines • Experiment Tracking	███████████░ 88%
🚀 Model Deployment	ONNX • TensorRT • Quantization • Inference Optimization	██████████░░ 82%

🏆 Notable Achievements

🎓 Academic

MSc Computer Vision,
Robotics & ML
University of Surrey

📄 Dissertation: Cross-modal latent fusion for multimodal face generation

📚 Publications

2 Research Papers

✨ Generative AI Architectures
(AI Accelerator Institute, 2024)

🔬 Computer Vision Systems
(IJESC, 2019)

🏅 Hackathon

AI Encode London Winner

⚡ Built a real-time AI prototype in 48 hours

🚀 Production-ready system demo

💡 Featured Projects

🎭 Cross-modal Face Generation

MSc Dissertation: Latent fusion for multimodal synthesis
Tech: VAEs • Cross-modal Fusion • PyTorch
Status: Published Research

🤝 Open to Collaboration

I'm actively seeking opportunities to collaborate on cutting-edge research problems:

Research Area	Specific Interests
🌍 World Models	Environment dynamics • Predictive modeling • Spatial reasoning
🎬 Video Generation	Temporal coherence • Motion synthesis • Controllable generation
🔄 Multimodal AI	Cross-modal fusion • Unified representations • Audio-visual learning
⏰ Temporal Reasoning	Long-range dependencies • Sequence modeling • Event prediction
🎮 Interactive Systems	Real-time generation • Human-AI collaboration • Embodied AI

📍 Location & Contact

📍 London, United Kingdom
🎓 MSc Computer Vision, Robotics & ML @ University of Surrey
🔬 Researching at the intersection of generative AI, video understanding, and world models

Let's Connect!

⚡ Building the future of generative AI, one model at a time ⚡

_{Specializing in multimodal systems, video generation, and scalable ML infrastructure}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly