Juarez Monteiro jrzmnt

🔭 I'm currently working on RL + Language Models for decision making under uncertainty
🌱 I'm currently learning Alignment techniques (DPO, preference optimization, evaluators)
👯 I'm looking to collaborate on Research projects involving RL and LLMs
🤝 I'm looking for help with Academic collaboration and research discussions
💬 Ask me about Reinforcement Learning, LLM evaluation & uncertainty, Agents and tool-using models, Experiment design and reproducibility
📫 How to reach me Open an issue in any repo or reach out via GitHub discussions
👨‍💻 All of my projects are available at https://jrzmnt.github.io/
📄 Know about my experiences https://www.linkedin.com/in/juarez-monteiro/

Provide feedback