Skip to content

jonathanbaraldi/jonathanbaraldi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

3 Commits
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Research Focus

I am a Research Engineer bridging the gap between theoretical AI safety and scalable infrastructure. My work focuses on Knowledge Provenanceโ€”tracing the specific training data responsible for model behaviors during inference.

I build "Glass Box" tooling to make Large Language Models auditable by design.

๐Ÿ”ฌ Current Research & Methodologies

  • Knowledge Provenance Maps: A framework for tracking gradient influence at the document level during the training loop.
  • Mechanistic Interpretability: Developing 3D visualization tools to map "knowledge anatomy" across transformer layers.
  • Model Health Metrics: Author of the Sparsity, Concentration, and Utilization metrics for diagnosing fine-tuning efficacy.

๐Ÿ“œ Selected Publications & Pre-prints

๐Ÿ› ๏ธ Open Source Engineering

  • knowledge-provenance-suite: A PyTorch-based library for tracking training data influence and visualizing 3D knowledge maps. (Formerly "Transparent AI Suite").
  • granular-gradient-tracker: Memory-efficient implementation of per-sample gradient tracking for LLMs.

๐Ÿ“ซ Contact

  • Collaboration: I am open to research collaborations on interpretability and model steering.
  • Email: [email protected]

About

Presentation repo

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors