-
🔭 I'm currently working on RL + Language Models for decision making under uncertainty
-
🌱 I'm currently learning Alignment techniques (DPO, preference optimization, evaluators)
-
👯 I'm looking to collaborate on Research projects involving RL and LLMs
-
🤝 I'm looking for help with Academic collaboration and research discussions
-
💬 Ask me about Reinforcement Learning, LLM evaluation & uncertainty, Agents and tool-using models, Experiment design and reproducibility
-
📫 How to reach me Open an issue in any repo or reach out via GitHub discussions
-
👨💻 All of my projects are available at https://jrzmnt.github.io/
-
📄 Know about my experiences https://www.linkedin.com/in/juarez-monteiro/
CS PhD | AI Researcher | Data Scientist
-
Kunumi
- Brazil
-
03:54
(UTC -03:00) - https://jrzmnt.github.io/
- https://orcid.org/0000-0002-8831-5343
- in/juarez-monteiro
Pinned Loading
-
-
-
lung-disease-classification
lung-disease-classification PublicLung Disease Classification
Jupyter Notebook 3
-
ActionRecognitionSmallDatasets
ActionRecognitionSmallDatasets PublicEvaluating the Feasibility of Deep Learning for Action Recognition in Small Datasets
-
NathanGavenski/ABCO
NathanGavenski/ABCO PublicOfficial Pytorch implementation of Augmented Behavior Cloning from Observation
-
NathanGavenski/IUPE
NathanGavenski/IUPE PublicPytorch official implementation for Imitating Unknown Policies via Exploration.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



