Research
My research interests are in deep learning models, computation, and systems. I currently work on code generation and reasoning and lead FAIR's codegen research in North America, supporting a team of scientists and engineers across Menlo Park, New York, and Paris.
Some active and past areas of research include:
- Reasoning and code generation: co-leading Code World Model (CWM)
- Multi-modal generative models: co-leading Chameleon and building Transfusion
- Speech recognition: leading Libri-Light and building wav2letter
As part of my research and towards scaling large deep learning systems, I often build open source software; I led teams that created Flashlight, Shumai, and wav2letter, parts of which are now in PyTorch.
Selected Invited Talks
- Trends in Deep Learning Computation and Frameworks — Intel (2024)
- Multimodal Generative Models — Charles River Ventures & Greylock (2024)
- GPU Computing at Scale — University of Pennsylvania, CIS 565: GPU Programming and Architecture (2024)
- Shumai: Fast, Flexible Machine Learning in TypeScript — Google DeepMind (2023)
- Scaling Deep Learning for Automatic Speech Recognition — NVIDIA GPU Technology Conference (2020)
Background
I'm an AI researcher and engineer at FAIR, Meta AI.
I'm also on the computer science faculty at the University of Pennsylvania.
I completed undergraduate and graduate studies via the M&T Program at the University of Pennsylvania. I studied computer science at Penn Engineering and statistics/operations research at The Wharton School.
While at Penn, I ran PennApps, the largest collegiate hackathon in the world at the time.
I grew up outside of Chicago.
Teaching
I created and teach CIS 5690: GPU Programming for Machine Learning Systems at Penn.
I was a teaching assistant for several computer science courses while studying at Penn, including:
- CIS 380, 548, 700: Operating Systems (Spring/Fall 2017, 2018) - Head TA - Concurrency, resource management, virtual memory, file systems, virtual machines.
- CIS 505: Distributed Systems (Spring 2018) - Synchronization, communication, replication. Project-based, in C++.
- CIS 320: Algorithms (Spring/Fall 2017, 2018) - Head TA - Combinatorial optimization, sorting, hashing, graphs, complexity theory.
- CIS 240, 593: Introduction to Computer Architecture (Fall 2015, 2016) - Hardware structures, organization, machine language, C.
- CIS 550: Database and Information Systems (Spring 2017) - Structured data, modeling, architecture, and distributed processing.
