Mindful Machines

Alex Spies - Structured Representations in Maze-Solving Transformers

59 min · 1. kesä 2024
jakson Alex Spies - Structured Representations in Maze-Solving Transformers kansikuva

Kuvaus

In this discussion, Alex Spies will provide an overview of mechanistic interpretability tools and the approaches researchers employ to "reverse engineer" transformer models. He will then explain how his team used some of these techniques to uncover emergent structures in the models they trained and how these structures may facilitate a systematic understanding of internal search processes. What guarantees can Mechanistic Interpretability provide for logic-based programs (if any)? Alex is a PhD student at Imperial College London who is currently in Tokyo as a Research Fellow at the National Institute of Informatics (NII).

Kommentit

0

Ole ensimmäinen kommentoija

Rekisteröidy nyt ja liity Mindful Machines-yhteisöön!

Aloita maksutta

14 vrk ilmainen kokeilu

Kokeilun jälkeen 7,99 € / kuukausi. · Peru milloin tahansa.

  • Podimon podcastit
  • 20 kuunteluaikaa / kuukausi
  • Lataa offline-käyttöön

Kaikki jaksot

3 jaksot