Fully Vested
Many of the core technologies behind Generative AI are not exactly brand new. For example, the "Attention Is All You Need [https://arxiv.org/abs/1706.03762]" paper, which described and introduced the Transformer model (the "T" in ChatGPT), was published in 2017. Diffusion models—the backbone of image generation tools like StableDiffusion and DALL-e—were introduced in 2015 [https://arxiv.org/pdf/1503.03585.pdf] and were originally inspired by thermodynamic modeling techniques. Generative adversarial networks (GANs) were introduced [https://proceedings.neurips.cc/paper_files/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf] in 2014. However, Generative AI has seemingly taken the world by storm over the past couple years. In this episode, Graham and Jason discuss—in broad strokes—what Generative AI is, what's required to train and run foundation models, where the value lies, and frontier challenges. FACT-CHECKING AND CORRECTIONS Before we begin... * At around 36:16 Jason said that the Pile was compiled by OpenAI or one of its research affiliates. This is not correct. The Pile was compiled by Eleuther.ai, and we couldn't find documentation suggesting that OpenAI incorporates the entirety of The Pile into its training data corpus. * At 49:07 Jason mentions "The Open Source Institute" but actually meant to mention the Open Source Initiative [https://opensource.org/] APPLIED MACHINE LEARNING 101 Not all AI and applied machine learning models are created equally, and models can be designed to complete specific types of tasks. Broadly speaking, there are two types of applied machine learning models: Discriminative and Generative. DISCRIMINATIVE AI Definition: Discriminative AI focuses on learning the boundary between different classes of data from a given set of training data. Unlike generative models that learn to generate data, discriminative models learn to differentiate between classes and make predictions or decisions based on the input data. Historical Background TLDR: * The development of Discriminative AI has its roots in statistical and machine learning approaches aimed at classification tasks. * Logistic regression and Support Vector Machines (SVMs) are early examples of discriminative models, which have been used for many years in various fields including computer vision and natural language processing. * Over time, with the development of deep learning, discriminative models like Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) have become highly effective for a wide range of classification tasks. Pop Culture Example(s): * "Hotdog vs. Not a Hotdog algorithm [https://www.youtube.com/watch?v=ACmydtFDTGs]" from HBO's Silicon Valley (S4E4) * Image recognition capabilities of something like Iron Man alter ego Tony Stark's JARVIS (2008) **Real-World Example(s * Automatic speech recognition (ASR) * Spam and abuse detection * Facial recognition, such as Apple's Face ID and more Orwellian examples in places ranging from China to England Further Reading: * Discriminative Model [https://en.wikipedia.org/wiki/Discriminative_model] (Wikipedia) GENERATIVE AI Definition: Generative AI refers to a type of artificial intelligence that is capable of generating new data samples that are similar to a given set of training data. This is achieved through algorithms that learn the underlying patterns, structures, and distributions inherent in the training data, and can generate novel data points with similar properties. Historical Background TLDR: * The origins of Generative AI can be traced back to the development of generative models, with early instances including probabilistic graphical models in the early 2000s. * However, the field truly began to gain traction with the advent of Generative Adversarial Networks (GANs) b y Ian Goodfellow and his colleagues in 2014. * Since then, various generative models like Variational Autoencoders (VAEs) and others have also gained prominence, contributing to the rapid advancement of Generative AI. Pop Culture Example: * The AI from the movie Her (2013) Real-World Example(s): * OpenAI's GPT family, alongside image models like StableDiffusion, and Midjourney. Further Reading: * Deepgram's Generative AI page [https://deepgram.com/ai-glossary/generative-ai] in the AI Glossary... co-written by Jason and GPT-4. * Large Language Model [https://deepgram.com/ai-glossary/large-language-model] in the Deepgram AI Glossary... also co-written by Jason and GPT-4. * The Physics Principle That Inspired Modern AI Art [https://www.quantamagazine.org/the-physics-principle-that-inspired-modern-ai-art-20230105/] (Anil Ananthaswamy, for Quanta Magazine) * Visualizing and Explaining Transformer Models From the Ground Up [https://deepgram.com/learn/visualizing-and-explaining-transformer-models-from-the-ground-up] (Zian "Andy" Wang for the Deepgram blog, January 2023) * Transformer Explained [https://paperswithcode.com/method/transformer] hub on PapersWithCode * Transformers, Explained: Understand the Model Behind GPT-3, BERT, and T5 [https://daleonai.com/transformers-explained] (Dale Markowitz on his blog, Dale on AI., May 2021) FURTHER READING BY TOPIC In rough order of when these topics were mentioned in the episode... ECONOMIC/INDUSTRY IMPACTS OF AI How Large Language Models Will Transform Science, Society, and AI [https://hai.stanford.edu/news/how-large-language-models-will-transform-science-society-and-ai] (Alex Tamkin and Deep Ganguli for Stanford HAI's blog, February 2021) The Economic Potential of Generative AI: The Next Productivity Frontier [https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/the-economic-potential-of-generative-ai-the-next-productivity-frontier] ( McKinsey & Co., June 2023) Generative AI Could Raise Global GDP by 7% [https://www.goldmansachs.com/intelligence/pages/generative-ai-could-raise-global-gdp-by-7-percent.html] (Goldman Sachs, April 2023) Generative AI Promises an Economic Revolution. Managing the Disruption Will Be Crucial. [https://www.wsj.com/articles/generative-ai-promises-an-economic-revolution-managing-the-disruption-will-be-crucial-b1c0f054] (Bob Fernandez for WSJ Pro Central Banking, August 2023) The Economic Case for Generative AI and Foundation Models [https://a16z.com/the-economic-case-for-generative-ai-and-foundation-models/] (Martin Casado and Sarah Wang for the Andreessen Horowitz Enterprise blog, August 2023) Generative AI and the software development lifecycle [https://www.thoughtworks.com/en-us/insights/articles/generative-ai-software-development-lifecycle-more-than-coding-assistance](Birgitta Böckeler and Ryan Murray for Thoughtworks, September 2023) How generative AI is changing the way developers work [https://github.blog/2023-04-14-how-generative-ai-is-changing-the-way-developers-work/] (Damian Brady for The GitHub Blog, April 2023) The AI Business Defensibility Problem [https://datastream.substack.com/p/the-ai-business-defensibility-problem] (Jay F. publishing on their Substack, The Data Stream) USING LANGUAGE MODELS EFFECTIVELY The emerging types of language models and why they matter [https://techcrunch.com/2022/04/28/the-emerging-types-of-language-models-and-why-they-matter/?guccounter=1] (Kyle Wiggers for TechCrunch, April 2023) * * Crafting AI Commands: The Art of Prompt Engineering [https://deepgram.com/learn/what-is-prompt-engineering] (Nithanth Ram for the Deepgram blog, March 2023) Prompt Engineering [https://lilianweng.github.io/posts/2023-03-15-prompt-engineering/] (Lilian Weng on her blog Lil'Log, March 2023) Prompt Engineering Techniques: Chain-of-Thought [https://deepgram.com/learn/chain-of-thought-prompting-guide] & Tree-of-Thought [https://deepgram.com/learn/tree-of-thoughts-prompting] (both by Brad Nikkel for the Deepgram blog) 11 Tips to Take Your ChatGPT Prompts to the Next Level [https://www.wired.com/story/11-tips-better-chatgpt-prompts/] (David Nield for WIRED, March 2023) Prompt Engineering 101 [https://humanloop.com/blog/prompt-engineering-101] (Raza Habib and Sinan Ozdemir for the Humanloop blog, December 2022) HERE THERE BE DRAGONS Hallucinations * Hallucination (artificial intelligence) [https://en.wikipedia.org/wiki/Hallucination_%28artificial_intelligence%29] (Wikipedia) * Chatbot Hallucinations Are Poisoning Web Search [https://www.wired.com/story/fast-forward-chatbot-hallucinations-are-poisoning-web-search/] (Will Knight for WIRED, October 2023) * How data poisoning attacks corrupt machine learning models [https://www.csoonline.com/article/570555/how-data-poisoning-attacks-corrupt-machine-learning-models.html] (Lucian Constantin for CSO Online) Data Poisoning & Related * Data Poisoning [https://paperswithcode.com/task/data-poisoning] hub on PapersWithCode * Glaze - Protecting Artists from Generative AI [https://glaze.cs.uchicago.edu/] project from UChicago (2023) * Self-Consuming Generative Models Go MAD [https://arxiv.org/abs/2307.01850] (Alemohammad et al. on ArXiv, July 2023) * What Happens When AI Eats Itself [https://deepgram.com/learn/when-ai-eats-itself] (Tife Sanusi for the Deepgram blog, August 2023) * The AI is eating itself [https://www.platformer.news/p/the-ai-is-eating-itself] (Casey Newton for Platformer, June 2023) * AI-Generated Data Can Poison Future AI Models [https://www.scientificamerican.com/article/ai-generated-data-can-poison-future-ai-models/] (Rahul Rao for Scientific American, July 2023) Intellectual Property and Fair Use * Measuring Fair Use: The Four Factors - Copyright Overview [https://fairuse.stanford.edu/overview/fair-use/four-factors/] (Rich Stim for the Stanford Copyright and Fair Use Center) * Is the Use of Copyrighted Works to Train AI Qualified as a Fair Use [https://copyrightalliance.org/copyrighted-works-training-ai-fair-use/] (Cala Coffman for the Copyright Alliance blog, April 2023) * Reexamining "Fair Use" in the Age of AI [https://hai.stanford.edu/news/reexamining-fair-use-age-ai] (Andrew Myers for Stanford HAI) * Copyright Fair Use Regulatory Approaches in AI Content Generation [https://techpolicy.press/copyright-fair-use-regulatory-approaches-in-ai-content-generation/] (Ariel Soiffer and Aric Jain for Tech Policy Press, August 2023) * Japan's AI Data Laws, Explained [https://www.deeplearning.ai/the-batch/japan-ai-data-laws-explained/] (Deeplearning.ai) * PDF: Generative Artificial Intelligence and Copyright Law [https://crsreports.congress.gov/product/pdf/LSB/LSB10922] (Congressional Research Center, September 2023) Academic and Creative "Honesty" * How it started. New AI classifier for indicating AI-written text [https://openai.com/blog/new-ai-classifier-for-indicating-ai-written-text] (Kirchner et al., January 2023) * How it's going. OpenAI Quietly Shuts Down Its AI Detection Tool [https://decrypt.co/149826/openai-quietly-shutters-its-ai-detection-tool] (Jason Nelson for Decrypt) * AI Homework [https://stratechery.com/2022/ai-homework/] (Ben Thompson on Stratechery, December 2022) * Teaching With AI [https://openai.com/blog/teaching-with-ai] (OpenAI, August 2023) Human Costs of AI Training (Picking on OpenAI here, but RLHF and similar fine-tuning techniques are employed by many/most LLM developers) * Cleaning Up ChatGPT Takes Heavy Toll on Human Workers [https://www.wsj.com/articles/chatgpt-openai-content-abusive-sexually-explicit-harassment-kenya-workers-on-human-workers-cf191483] (Karen Hao and Deepa Seetharaman for the Wall Street Journal) * ‘It’s destroyed me completely’: Kenyan moderators decry toll of training of AI models [https://www.theguardian.com/technology/2023/aug/02/ai-chatbot-training-human-toll-content-moderator-meta-openai] (Niamh Rowe in The Guardian, August 2023) * He Helped Train ChatGPT. It Traumatized Him. [https://www.bigtechnology.com/p/he-helped-train-chatgpt-it-traumatized] (Alex Kantrowitz in his publication Big Technology, May 2023) * https://www.nytimes.com/2023/09/25/technology/chatgpt-rlhf-human-tutors.html [https://www.nytimes.com/2023/09/25/technology/chatgpt-rlhf-human-tutors.html] Big Questions * Open questions for AI engineering [https://simonwillison.net/2023/Oct/17/open-questions/] (Simon Willison, October 2023) ADAM SMITH AND THE PIN FACTORY 📚 An Inquiry into the Nature and Causes of the Wealth of Nations by Adam Smith [https://www.gutenberg.org/ebooks/3300] (via Project Gutenberg) Division of Labor and Specialization [https://www.econlib.org/library/topics/highschool/divisionoflaborspecialization.html] (Econlib) Adam Smith and the Pin Factory [https://www.johnkay.com/2019/12/18/adam-smith-and-the-pin-factory/] (John Kay on his blog, September 2019) The Pin Factory [https://www.adamsmithworks.org/pin_factory.html] (Adam Smith Works, a project of the Liberty Fund) Adam Smith and Pin-making: Some Inconvenient Truths [https://conversableeconomist.com/2022/08/23/adam-smith-and-pin-making-some-inconvenient-truths/] (Timothy Taylor publishing on his blog, Conversable Economist. August 2022) ALSO MENTIONED TRAINING DATASETS The Pile [https://pile.eleuther.ai/] ImageNet [https://www.image-net.org/] ARTICLES AND BOOKS (Alleged) Leaked Google memo mentioned at around 48:05 Google "We Have No Moat, And Neither Does OpenAI [https://www.semianalysis.com/p/google-we-have-no-moat-and-neither] (SemiAnalysis publishing a whitepaper, allegedly written by a Google staff member, which suggested that open source advancements pose an existential threat to both Google and OpenAI. May 2023) Infinite Jest [https://en.wikipedia.org/wiki/Infinite_Jest] (Wikipedia page on the 1996 novel by the late David Foster Wallace), mentioned around 54:30 HISTORY OF FAXES VS. EMAILS In reference to 1:08:30 or thereabouts: A fun fact which blew our mind: The earliest instances of facsimile transmission stretch back to the 1890s, and the core technology matured up through the 1940s. The first telephonic fax patent was filed in 1964 by Xerox Corporation. The first email was sent in 1971.
45 episoder
Kommentarer
0Vær den første til at kommentere
Tilmeld dig nu og bliv en del af Fully Vested-fællesskabet!