GitHub - jrzmnt/jrzmnt

🔭 I'm currently working on RL + Language Models for decision making under uncertainty
🌱 I'm currently learning Alignment techniques (DPO, preference optimization, evaluators)
👯 I'm looking to collaborate on Research projects involving RL and LLMs
🤝 I'm looking for help with Academic collaboration and research discussions
💬 Ask me about Reinforcement Learning, LLM evaluation & uncertainty, Agents and tool-using models, Experiment design and reproducibility
📫 How to reach me Open an issue in any repo or reach out via GitHub discussions
👨‍💻 All of my projects are available at https://jrzmnt.github.io/
📄 Know about my experiences https://www.linkedin.com/in/juarez-monteiro/

Name		Name	Last commit message	Last commit date
Latest commit History 389 Commits
.github/workflows		.github/workflows
profile-3d-contrib		profile-3d-contrib
README.md		README.md
banner.png		banner.png
dblp.svg		dblp.svg
homepage.svg		homepage.svg
in.svg		in.svg
instagram.svg		instagram.svg
lattes.svg		lattes.svg
researchgate.svg		researchgate.svg
scholar.svg		scholar.svg
twitter.svg		twitter.svg

Provide feedback