I'm working to make machines more helpful through unsupervised learning that scales.
At OpenAI I developed the Sparse Transformer with collaborators and also coauthored work showing the emergent capabilities of large language models in a variety of settings (GPT-2, GPT-3, Image GPT, and more). I also worked on reducing the limitations of those techniques (very deep VAEs) and collaborated to apply them on larger supercomputers (MT-NLG and PaLM).
More recently, I was on the founding team at Inflection AI. We were hired by Microsoft to form a new division of the company called Microsoft AI.
I am half-Japanese, and my Japanese name is 石井興元. よろしくお願いします. I live in Palo Alto, California.