Coverbild der Sendung Inside MySQL: Sakila Speaks

Inside MySQL: Sakila Speaks

Podcast von Oracle Corporation

Englisch

Wissen​schaft & Techno​logie

Loslegen

Dann 4,99 € / Monat. Jederzeit kündbar.

  • 20 Stunden Hörbücher / Monat
  • Podcasts nur bei Podimo
  • Alle kostenlosen Podcasts

Mehr Inside MySQL: Sakila Speaks

The Inside MySQL, Sakila Speaks podcast is dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL product updates, and inciteful interviews with members of the MySQL Community. Sit back and enjoy as your hosts, Fred Descamps and Scott Stroz, bring you the latest updates on your favorite open-source database.

Alle Folgen

15 Folgen

Episode MyVector Magic: Elevating MySQL with AI Search Cover

MyVector Magic: Elevating MySQL with AI Search

Oracle Ace Alkin Tezuysal joins leFred and Scott to introduce the MyVector plugin for MySQL Community Edition, bringing powerful vector search capabilities to your favorite open-source database. Learn how MyVector enables advanced AI and similarity search features, why this matters for modern applications, and how the MySQL community can easily get started. ------------------------------------------------------------- Episode Transcript: 00:00.000 --> 00:25.000 Welcome to Inside MySQL: Sakila Speaks, a podcast dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL product updates and insightful interviews with members of the MySQL community. 00:25.000 --> 00:32.000 Sit back and enjoy as your hosts bring you the latest updates on your favorite open source database. Let's get started. 00:32.000 --> 00:37.000 Hello and welcome to Sakila Speaks, the podcast dedicated to MySQL. I'm LeFred. 00:37.000 --> 00:38.000 And I'm Scott Stroz. 00:38.000 --> 00:47.000 Joining us today is Alkin Tezuysal. We know each other for a long time already and Alkin serves as Director of Services at Altinity Inc. 00:47.000 --> 00:55.000 Bringing over 30 years of experience in open source relational databases with deep expertise in MySQL, of course, and ClickHouse. 00:55.000 --> 01:08.000 He co-authored key references works including MySQL Cookbook 4th edition that came in 2022 and Database Design and Modeling with Postgres and MySQL in 2024. 01:08.000 --> 01:21.000 Alkin, you have been honored as MySQL Rockstar in 2023. And since this year, you are also an Oracle Ace Pro for MySQL. Congratulations and welcome to Inside MySQL: Sakila Speaks. 01:21.000 --> 01:23.000 Thank you very much, everyone. 01:23.000 --> 01:34.000 We're glad you're here. Alkin, as you may not know, this season of the podcast is dedicated to all things AI as it relates to MySQL and HeatWave. 01:34.000 --> 01:43.000 And you actually created or wrote a plugin for MySQL Community that kind of helped with that, MyVector. 01:43.000 --> 01:48.000 Can you give us an overview of what MyVector is and what problem it's meant to solve? 01:48.000 --> 01:50.000 Sure. Thank you very much for the question. 01:50.000 --> 02:00.000 And I'm very happy that this year of AI and HeatWave, everything that actually contributes to this technology because it's fairly new. 02:00.000 --> 02:06.000 It's been developing for many years, as we already know, but now it's in our hands. 02:06.000 --> 02:16.000 We can use it. We can definitely use it on our day-to-day activities, whether it's troubleshooting your dishwasher or your washing machine. 02:16.000 --> 02:20.000 But we could also use it in a business-wise database. 02:20.000 --> 02:29.000 So one correction I want to make is I am a contributor to MyVector plugin, not to author. 02:29.000 --> 02:34.000 The author is Shankar Iyer, and he's a developer for databases for many years. 02:34.000 --> 02:40.000 He's got a lot of experience where I've actually been presenting and supporting this project. 02:40.000 --> 02:49.000 And that's the small correction. Other than that, MyVector is a native plugin for MySQL that adds support for storing and searching high dimensional vectors. 02:49.000 --> 02:55.000 This is basically a very, in simple terms, what it does. 02:55.000 --> 03:00.000 And this has been in development for some time. 03:00.000 --> 03:14.000 And as we have seen other, you know, databases, other open source databases also went into this with the, you know, launching of AI to our, you know, end users. 03:14.000 --> 03:24.000 Adding approximate nearest neighbor n-search directly in SQL within MySQL database was kind of needed. 03:24.000 --> 03:29.000 And there has been similar implementations with MySQL. 03:29.000 --> 03:33.000 But MyVector is the open source version of that as a plugin. 03:33.000 --> 03:39.000 So just to wrap up that answer is MyVector column type for embedding storage. 03:39.000 --> 03:41.000 And there's a MyVector. 03:41.000 --> 03:46.000 There's a bunch of functions that MyVector distance for the similarity competition. 03:46.000 --> 03:50.000 Of course, it uses HNSW-based index algorithm, which is very popular. 03:50.000 --> 03:52.000 There's a white paper around it. 03:52.000 --> 04:01.000 It's not a rocket science or just something that was invented for MyVector that is known science. 04:01.000 --> 04:06.000 And basically, it provides an SQL native interface within MySQL. 04:06.000 --> 04:08.000 Hope that answers that question. 04:08.000 --> 04:10.000 Thank you very much, Alkin, yeah. 04:10.000 --> 04:22.000 It answers everything and very happy that you also, let's say, talk about the author that we already met also in Belgium recently. 04:22.000 --> 04:31.000 So I would like to ask you, so why is it important to have this similarity search indexes in MySQL then? 04:31.000 --> 04:40.000 Yeah. So again, going back to the AI-driven application, semantic search, product recommendation, question and answering, anomaly detection, etc. 04:40.000 --> 04:43.000 These really require a similarity searches. 04:43.000 --> 04:47.000 Have we done similarity searches in the past? Yes, we have. 04:47.000 --> 04:52.000 If you remember, this is a long, long time ago, but those technologies are still in effect. 04:52.000 --> 05:03.000 And we had search indexes like the Solr, this Phoenix, if you recall those, where we used to have a replica, generate index and search for it. 05:03.000 --> 05:10.000 I used to work for an e-commerce site and users would search for a product. 05:10.000 --> 05:15.000 And then we would also display the similar products. 05:15.000 --> 05:23.000 And in order to do that in MySQL, we had to use external services like, like I said, some search. 05:23.000 --> 05:25.000 So it is very important. 05:25.000 --> 05:29.000 But with the AI-driven application, it's not important anymore. 05:29.000 --> 05:30.000 It's a must have. 05:30.000 --> 05:35.000 Basically, you don't need to run a separate vector database. 05:35.000 --> 05:45.000 And basically, if the data is already in MySQL, you could use this technology using, you know, similarity search functionalities. 05:45.000 --> 05:49.000 Back at FOSDEM, you gave a presentation about MyVector. 05:49.000 --> 05:55.000 And over the weekend at FOSDEM, there were a lot of other sessions about vector and indexes. 05:55.000 --> 06:01.000 Has MyVector made any significant changes since you last talked about it in public? 06:01.000 --> 06:07.000 Yes, there was another public talk after FOSDEM that was a vector search conference. 06:07.000 --> 06:14.000 And we've had a bunch of talks about vector searches, vector technologies, which was around this open source databases, including MySQL. 06:14.000 --> 06:19.000 There were, I think, four or five MySQL talks around the vector search. 06:19.000 --> 06:33.000 From the development side, yes, there's one important improvement that was made that was the necessary support for binary distributions other than the Docker images. 06:33.000 --> 06:43.000 So we worked on those and built, you know, three different versions of MySQL binary distributions for testing, because it's more like a DIY. 06:43.000 --> 06:51.000 And you have to compile and everyone is not very competent enough or have enough time to compile MySQL. 06:51.000 --> 07:02.000 So we built images for 8.0 and 8.4 and 9x versions for easy testing. 07:02.000 --> 07:12.000 And there were some improvements on performance and index stability, of course, and so that's about it. 07:12.000 --> 07:18.000 Maybe it doesn't sound a lot, but this is a lot of work, basically, considering it's an open source project. 07:18.000 --> 07:21.000 Yeah, thank you. I can imagine it's a lot of work. 07:21.000 --> 07:31.000 So let's go now in the more technical, let's dig a bit in technical and a bit deeper there. 07:31.000 --> 07:41.000 So you said earlier that MyVector is using this HNSW, which is a hierarchical navigable small world indexes, right? 07:41.000 --> 07:48.000 Why was this type chosen over other or over alternatives? 07:48.000 --> 07:55.000 And do you know if or you yourself have tried alternatives or not? 07:55.000 --> 07:59.000 We would like to know a bit more about why that choice. 07:59.000 --> 08:01.000 That's a great question, actually. 08:01.000 --> 08:11.000 And when we first all heard or started knowing about this HNSW, hierarchical navigable small word for the n-search, like approximate nearest neighbor search. 08:11.000 --> 08:21.000 That was, it sounded like when I did my research and started reading about it, I think we met with you in London last year. 08:21.000 --> 08:26.000 We were talking about this, you know, the n-search and everything else. 08:26.000 --> 08:33.000 This is basically, I thought it was more like a de facto standard of the n-search. 08:33.000 --> 08:44.000 And it turned out to be that way because a lot of the other open source databases or implementations were circling around HNSW. 08:44.000 --> 08:49.000 And that's not to say that there are not other options out there. 08:49.000 --> 09:00.000 But usually when technologies like this launched, you don't go and reinvent the wheel, but basically build upon an existing technology. 09:00.000 --> 09:09.000 Since HNSW was widely available in terms of a knowledge wise, it was chosen HNSW. 09:09.000 --> 09:13.000 And, you know, it has high accuracy. 09:13.000 --> 09:16.000 It's a, it's got support for dynamic inserts and leads. 09:16.000 --> 09:19.000 And, and it has an efficient memory usage. 09:19.000 --> 09:21.000 These are the top three things that I know about it. 09:21.000 --> 09:31.000 But, you know, you know, from the other open source databases, like I said, the benchmarking were all circling around this. 09:31.000 --> 09:41.000 And if you were to use a different indexing, it would be very difficult to compare apple to apple from a different indexing perspective. 09:41.000 --> 09:52.000 So, I think, again, I'm, I'm not saying there are no other and methods there are, but they might be less accurate. 09:52.000 --> 09:54.000 They may have different, options. 09:54.000 --> 10:04.000 but if you want to kind of, play something in the market that everybody knows, it would be better off, using the known, methodologies. 10:04.000 --> 10:11.000 So you've given me something I need to look up so I know what I'm going to be doing over the weekend, which HNSW. 10:11.000 --> 10:18.000 so does the data that we use need to be trained before it can be indexed? 10:18.000 --> 10:20.000 No, there's no training. 10:20.000 --> 10:23.000 Basically it's the, it's the, embeddings. 10:23.000 --> 10:30.000 The, the, the difference between the training and the embeddings is you just need to generate the embeddings. 10:30.000 --> 10:55.000 And that's where, that's where an additional step, like if your data is already in the database and you, you want to use this, vector, search technology using HNSW indexing for the n-search, you need to generate the embeddings, whether externally or internally with, with, with a service or, something like that. 10:55.000 --> 11:18.000 We, I know that there are some, the, the other types of index that are maybe less popular, that, index the embeddings, sometime they also need to, to have some training before, but, yeah, this one doesn't, which is, which is good because every time you want to, to add the data, whatever, it's quite complicated if you want to train it. 11:18.000 --> 11:19.000 Right. 11:19.000 --> 11:20.000 Yeah. 11:20.000 --> 11:25.000 Basically it's, it's, it's generate the embedding, insert in the MySQL and build the HNSW index. 11:25.000 --> 11:38.420 So, as you are discussing about, this, this index, what, what I'm, curious because, I also try, I try and check, different type of flow indexes to, to understand what they do and what it is. 11:38.420 --> 11:46.160 but, I would like to know what's the size of this index compared to the actual size of the data, right? 11:46.440 --> 11:59.440 Because I know, and maybe it's the case, on your implementation that, the full representation of the, of the, of the vector is stored on the index on some of them or most of them. 11:59.440 --> 12:09.620 So I would like to know, if you have made some check there and, if, if the size is compared, right, to the, the, the full embeddings and the index. 12:09.920 --> 12:16.940 Just to recap that, the, the, the full vector is stored inside the index structure on a fast axis. 12:17.540 --> 12:22.560 So, so there's, there's no reference in back or anything like that. 12:22.560 --> 12:46.380 It's in the, this, we were talking about this, the size of the, index is that depends on the, vector dimension dimensions, a number of vectors that we're storing and, and then, and some of the parameters that, you know, per node, that, that index, but, we did some, some sizing and testing around it as yes. 12:46.380 --> 12:54.160 The accuracy increases when the dimensions are high as we know, and the size, size gets, gets higher. 12:54.440 --> 13:04.800 So, we're looking into this also, if there is any, any option to optimize that or, use some compression technology to, for this index. 13:05.160 --> 13:16.360 And, that's, something, is, is kind of, important to know that because this is not in, you know, DB, this is basically in the file system. 13:16.380 --> 13:19.080 And it needs to be, you know, placed correctly. 13:19.360 --> 13:32.200 You, you mentioned before how the, how MyVector is available and I know it's available as a plugin and just to clarify for our listeners, do you also provide the binary packages or do we still need to compile it, from the source? 13:32.980 --> 13:33.480 Yes. 13:33.480 --> 13:46.860 As I mentioned earlier, since FOSDEM the, the, the latest, release of, MyVector included, the, binary releases or, x86. 13:47.480 --> 13:53.840 And, and, that will, that is published in the GitHub page and, as open source. 13:54.100 --> 13:57.220 so you no longer need to compile it from the source. 13:57.220 --> 14:02.360 If you want to test it out, just to plug in, my, um, 14:02.360 --> 14:16.980 I have launched a new blog technical blog page and started blogging and, I will blog about this so, so that, the listeners and, and the readers can actually, have a link available in the, in the blog. 14:16.980 --> 14:28.440 So they can go in and test it out and Docker images were available, but, binary releases also added, recently, for the, for the test. 14:28.440 --> 14:30.380 Awesome. Thank you, Alkin. 14:30.720 --> 14:42.720 So last question, do you know if there are already, some, companies or, users, using MyVector in production or not yet? 14:43.260 --> 14:45.500 There are some POCs going on. 14:45.860 --> 14:47.700 And as you know, this is open source. 14:47.700 --> 14:58.360 there, there were some interest after force them, you know, we, people reached out, who were actually doing, this type of research and analysis. 14:58.360 --> 15:03.920 It was on the existing MySQL databases and, we provided help and information. 15:04.380 --> 15:05.540 They might have taken it. 15:05.600 --> 15:06.580 They might have forked it. 15:06.640 --> 15:08.980 They might have embedded into their existing implementation. 15:09.580 --> 15:16.500 we don't know, but, as far as I know, there, there are a few POCs are going, they're testing with the existing data. 15:16.500 --> 15:29.520 So, as you know, just to add up since FOSDEM, there has been one shift happened in this type of technology, which is the MCP servers. 15:29.520 --> 15:45.640 So, that is one thing that I wanted to add over here with that shift, generating embeddings and, and actually having an MCP server that will actually add context to the n-search. 15:45.640 --> 15:53.620 Like a chat bot implementation made it, made things a little bit more, not only useful, but also interesting. 15:54.320 --> 16:01.380 So say you have, you know, support tickets or some, some data that's actually related to your, you know, internal customers. 16:01.580 --> 16:03.600 You could add MCP server. 16:03.800 --> 16:06.160 There are some public MCP servers. 16:06.320 --> 16:09.780 There are some open source and MCP server implementations. 16:10.080 --> 16:12.300 And, and we're also looking into that. 16:12.560 --> 16:15.560 And, I want to mention over here on my last. 16:15.640 --> 16:24.200 talk during the vector search conference, I have actually, presented a MyVector with an MCP server demo. 16:24.200 --> 16:27.880 and, and that recording is should be available. 16:27.880 --> 16:30.900 So for that, and, and, and actually it works. 16:30.900 --> 16:44.520 this is pure MySQL open source, pure MyVector open source and pure MCP server open source with the, you know, clinical trials data, like the, one of the public data data sets that I've used. 16:44.520 --> 16:52.300 you can actually ask questions and it'll answer, and then you can continue the chat using this, this, very given technology. 16:52.300 --> 16:54.060 That's awesome. 16:54.060 --> 17:00.060 I've actually been playing around with, writing my own MCP servers and having them interact with MySQL. 17:00.060 --> 17:14.880 And I think MCP is going to be, is going to wind up being pretty, being pretty big because it does give that, domain specific context that the LLMs can actually use to generate content or answers or whatever. 17:14.880 --> 17:18.120 So I, I'm, I'm interested to play around with that. 17:19.000 --> 17:19.520 Absolutely. 17:19.740 --> 17:20.120 Absolutely. 17:20.120 --> 17:23.100 This is very interesting development, very recent. 17:23.740 --> 17:28.440 a lot of people are experimenting right now with the MCP servers. 17:29.160 --> 17:44.300 MCP servers, are going to be, I think the salt and pepper of, or, or sauce of, of this technology, you know, having the vectors and beddings and the, you know, n-search, HNSW index and the MCP server. 17:44.300 --> 17:47.220 So they, it's going to complete the puzzle in my opinion. 17:47.600 --> 17:50.920 And, and also, like I said, I've already given a demo. 17:51.060 --> 17:53.840 We are also looking into this, this technology. 17:54.120 --> 17:59.220 and, of course there's, that is also still under development. 17:59.740 --> 18:08.980 if you're opening up for a public MCP, then it's actually your, your security and compliance is, is now outside again. 18:08.980 --> 18:16.200 We want to do, if you want to do everything internally, if you want to do everything in your own database, with your own security and compliance. 18:16.700 --> 18:20.660 So that's a game changer to have something, available for yourself. 18:21.260 --> 18:21.660 Excellent. 18:21.840 --> 18:23.140 So thank you very much, Alkin. 18:23.660 --> 18:25.060 thank you for your time. 18:25.460 --> 18:27.740 We know you are on your boat right now, sailing. 18:27.960 --> 18:28.900 So that's awesome. 18:29.180 --> 18:29.580 Thank you. 18:29.640 --> 18:30.420 Thanks a lot, guys. 18:30.600 --> 18:31.120 Thank you, Alkin. 18:31.180 --> 18:33.120 And thank you for all your contributions to the community. 18:33.120 --> 18:36.880 That's a wrap on this episode of Inside MySQL: Sakila Speaks. 18:37.040 --> 18:38.280 Thanks for hanging out with us. 18:38.560 --> 18:42.320 If you enjoyed listening, please click subscribe to get all the latest episodes. 18:42.600 --> 18:45.540 We would also love your reviews and ratings on your podcast app. 18:45.860 --> 18:50.080 Be sure to join us for the next episode of Inside MySQL: Sakila Speaks.

18. Sept. 2025 - 19 min
Episode Homegrown Intelligence: AI Features for On-Prem MySQL Enterprise Cover

Homegrown Intelligence: AI Features for On-Prem MySQL Enterprise

leFred and Scott sit down with Gaurav Chadha to explore MySQL AI, a new solution that brings advanced AI features available in HeatWave to organizations running MySQL Enterprise Edition on-premises. Discover how MySQL AI bridges the gap between cloud innovation and on-premise infrastructure, making transformative AI capabilities more accessible, secure, and efficient for teams that rely on MySQL Enterprise Edition wherever their databases reside. -------------------------------------------------------------- Episode Transcript: 00:00.000 --> 00:25.000 Welcome to Inside MySQL: Sakila Speaks, a podcast dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL product updates and insightful interviews with members of the MySQL community. 00:25.000 --> 00:32.000 Sit back and enjoy as your hosts bring you the latest updates on your favorite open source database. Let's get started. 00:32.000 --> 00:37.000 Welcome back to another episode of Inside MySQL: Sakila Speaks. Hi, I'm LeFred. 00:37.000 --> 00:38.000 And I'm Scott Stroz. 00:38.000 --> 00:41.000 Today, we are thrilled to have Guarav Chadha joining us. 00:41.000 --> 00:51.000 Guarav is a Senior Development Manager leading development of MySQL HeatWave Lakehouse with a keen interest in systems, machine learning and computer architecture. 00:51.000 --> 01:10.000 Guarav brings a multifaceted expertise to database technology. Following the completion of his PhD from the University of Michigan, Ann Arbor, Guarav started at Oracle Labs in 2016, working on a research project which eventually graduated into MySQL HeatWave. 01:10.000 --> 01:16.000 But today we will talk with him about MySQL and AI on premise. Welcome Guarav. 01:16.000 --> 01:17.000 Thanks, Fred. Hi, Scott. 01:17.000 --> 01:18.000 Hi, Guarav. How are you? 01:18.000 --> 01:19.000 Doing good. 01:19.000 --> 01:32.000 So we're going to dive right in. And AI, we see AI is taking over the world. It's being touted for the solution to everything. 01:32.000 --> 01:41.000 How do you see AI transforming traditional on-premise database environments, especially in enterprise setups? 01:41.000 --> 01:54.000 Yes, Scott. So, I completely agree. AI is a transformational technology, and it has the potential to improve everything that we see around us. 01:54.000 --> 02:07.000 So, with regards to traditional on-premise database environments, especially in enterprise setups, I see multiple categories here. So, AI is a technology and a toolset. 02:07.000 --> 02:32.000 And like many other operators in databases, it can help with more and different data analysis. So, think of AI as a new set of SQL operators, which can tease out or analyze data and derive insights that are hard to do it with other operators, with other analysis tools. 02:32.000 --> 02:45.000 And hard for folks to call up. And hard for folks to code up. And that's where I think AI enhances it very easily enters into the database environments. 02:45.000 --> 02:56.000 What I mean by that is examples are recommendation systems, anomaly detection, so on and so forth. 02:56.000 --> 03:02.000 The other category is what I would say user assistance. 03:02.000 --> 03:15.000 So, not everyone is a SQL expert. And we want database technology and databases to be accessible to more people who may or may not come from a traditional database background. 03:15.000 --> 03:22.000 And SQL is a very powerful language and where it can be daunting to start with. 03:22.000 --> 03:35.000 So, again, this is a general category where maybe folks who are not very familiar with a specific programming language like SQL could write things out in just plain natural text. 03:35.000 --> 03:42.000 And AI tools could translate this into a programmatic interface or programmatic language or SQL directly. 03:42.000 --> 03:50.000 And that's another facet where I think AI can make database systems more approachable to a larger category of folks. 03:50.000 --> 03:57.000 It can also give you more user friendly responses, like instead of saying, oh, here's the error code, something went wrong. 03:57.000 --> 04:00.000 It can give you more information, more user friendly responses. 04:00.000 --> 04:06.000 So those are some examples of where I would say the second category, user assistance. 04:06.000 --> 04:12.000 The third category of where AI could help is database management. 04:12.000 --> 04:21.000 So databases are systems of record, the sources of truth and have a very high bar of staying up and being available. 04:21.000 --> 04:30.000 AI can help schedule maintenance at the right time where maybe the workload is low. 04:30.000 --> 04:35.000 They can predict things that might get slow. 04:35.000 --> 04:43.000 We have a whole area called predictive maintenance and make databases more highly available, more easily approachable. 04:43.000 --> 04:44.000 Thank you. 04:44.000 --> 04:46.000 This sounds very interesting. 04:46.000 --> 04:50.000 And because we are talking about MySQL on-prem, right? 04:50.000 --> 05:04.000 So from these categories, what features could we expect then one day to see in MySQL enterprise with AI or for AI? 05:04.000 --> 05:07.000 So what can you tell us about that? 05:07.000 --> 05:16.000 So for MySQL AI, we are bringing a whole host of AI features to on-premise MySQL deployments. 05:16.000 --> 05:22.000 And we will lean heavily with this first version on the first category, which is data analysis. 05:22.000 --> 05:25.000 How can AI help with data analysis? 05:25.000 --> 05:34.000 And within this, I would focus on, I would say, a few subcategories. 05:34.000 --> 05:48.000 The first is, with AI and generative AI specifically, it has brought the industry a new tool set to search through and understand documents. 05:48.000 --> 05:58.000 And not just structured data or relational data, just plain documents, which is true for a lot of enterprise companies. 05:58.000 --> 06:09.000 Companies have years and years worth of information stored in documents, in PDF documents, in HTML documents, and not really put into a database necessarily. 06:09.000 --> 06:12.000 And this has always been hard to search. 06:12.000 --> 06:23.000 It has been very manual, it has been very hard to bring to a database and perform a very fast and meaningful search. 06:23.000 --> 06:39.000 With generative AI and what we call vector store and vector search, you can search through unstructured data like documents, semantically, instead of just through keywords. 06:39.000 --> 06:42.000 You can search them by meaning. 06:42.000 --> 06:47.000 That's a very powerful technology that we are bringing to MySQL AI. 06:47.000 --> 07:02.000 So if users have documents in their file systems, they can ingest them into the database, and we will automatically create what we call a vector store out of it, which prepares the data in these documents to be searched semantically. 07:02.000 --> 07:13.000 Obviously, in order to this, we are adding a new operator, which does this semantic search, we call this vector distance. 07:13.000 --> 07:31.000 Additionally, I spoke about data analysis tools like recommendation systems, like anomaly detection, and these operators also being brought to MySQL AI, where you can plug them into your logs, or you can plug them into other metrics. 07:31.000 --> 07:40.000 And figure out when things can go wrong, or any other domain that is useful. 07:40.000 --> 07:46.000 An example of a domain for anomaly detection would be financial fraud, credit card fraud. 07:46.000 --> 07:49.000 So it's very useful in those scenarios. 07:49.000 --> 07:56.000 And the last category I would say among data analysis is generative AI. 07:56.000 --> 08:15.000 We're bringing LLMs to MySQL AI, and the power of LLMs really is they can generate new data and new user-friendly text from just bullet points, for instance. 08:15.000 --> 08:22.000 So not just analyzing data, but generating new data is possible through LLMs. 08:22.000 --> 08:29.000 So that I would say covers the first category. 08:29.000 --> 08:31.000 This is all data analysis. 08:31.000 --> 08:34.000 Among the second category, which is user assistance. 08:34.000 --> 08:41.000 User assistance is by bringing LLMs to on-premise MySQL AI deployments. 08:41.000 --> 08:50.000 It gives the user freedom to build more user-friendly applications or make the existing applications more user-friendly. 08:50.000 --> 08:53.000 And this is what we will start with, with version one of MySQL AI. 08:53.000 --> 09:04.000 So are there any specific features in MySQL Enterprise, like Firewall or Enterprise Audit, that support AI-enabled applications? 09:04.000 --> 09:09.000 So as we discussed, AI is an incredibly powerful set of tools and technologies. 09:09.000 --> 09:15.000 And this is our first salvo in enabling our customers to build and augment applications using AI. 09:15.000 --> 09:28.000 So we're bringing a whole tool set, we're bringing faster data analysis, more meaningful and different kinds of data analysis to help users build and augment existing applications. 09:28.000 --> 09:34.000 The door is certainly open to bringing AI to the ecosystem of products, as you mentioned, around the MySQL server. 09:34.000 --> 09:39.000 But with this first version, we are building these right into the MySQL server. 09:39.000 --> 09:54.000 With this MySQL AI, like you call it, right, is it compatible with or will it be compatible with all the architecture solutions that we also provide on-premise, 09:54.000 --> 10:01.000 like such as the InnoDB cluster, the cluster set, replica set, you know, for HA, for disaster recovery? 10:01.000 --> 10:08.000 If somebody goes in that direction, will he be able to keep deploying his MySQL the same way? 10:08.000 --> 10:15.000 So the AI feature set, the tool sets, are built right into the MySQL server. 10:15.000 --> 10:23.000 So all the architecture solutions which are deploying MySQL AI instances benefit from these features. 10:23.000 --> 10:33.000 Okay, so just to be clear, so we're not going to have two distinct products where one's MySQL EE and another one is MySQL AI. 10:33.000 --> 10:35.000 They'll all be together in the same product? 10:35.000 --> 10:41.000 So MySQL AI will be a distinct offering. 10:41.000 --> 10:48.000 It will have everything that Enterprise Edition has, plus the additional AI features we spoke about. 10:48.000 --> 10:51.000 And this is all about user choice. 10:51.000 --> 10:57.000 Customers can continue using MySQL EE if that is what they prefer. 10:57.000 --> 11:10.000 They can switch to MySQL AI and or buy new MySQL AI deployments to try out these new AI tools and get them familiar, get themselves familiar with it. 11:10.000 --> 11:17.000 The MySQL AI will have everything that Enterprise Edition has, plus the AI features. 11:17.000 --> 11:32.000 So it opens the door for users and customers to build their applications worry-free, as they've always built with MySQL EE, because the entire feature set of EE will be present in MySQL AI. 11:32.000 --> 11:37.000 But additionally, they can build more newer things with AI. 11:37.000 --> 11:43.000 In the previous episode, so the other speakers or guests, right? 11:43.000 --> 11:49.000 They extol the virtues of the cloud for AI, our cloud. 11:49.000 --> 11:52.000 Everything was nice and fast and it's good. 11:52.000 --> 12:06.000 And I would like to ask you if there are a performance trade-off between then deploying AI solutions in the cloud using MySQL HeatWave versus on-prem with this new MySQL AI. 12:06.000 --> 12:21.000 So with MySQL AI, we have brought the AI technology, we have built and deployed in the cloud to our EE customers, to on-premise environments. 12:21.000 --> 12:24.000 Cloud has some unquestionable advantages. 12:24.000 --> 12:28.000 Cloud has the benefit of scale-out, which can bring higher performance. 12:28.000 --> 12:37.000 It has GPUs, which can execute LLMs faster or larger LLMs, higher quality LLMs. 12:37.000 --> 12:48.000 So with MySQL AI, we have brought the AI technology that we built and deployed in the cloud to our on-premise customers and our Enterprise Edition customers. 12:48.000 --> 12:57.000 There are a very large, very large set of customers who are on-premise for a variety of reasons. 12:57.000 --> 13:02.000 And we want to serve them where they are. 13:02.000 --> 13:07.000 What I do want to point out is that cloud has some unquestionable advantages. 13:07.000 --> 13:14.000 It has the benefit of scaling out with more and more hardware, which can give you high performance. 13:14.000 --> 13:22.000 And like HeatWave, all these AI features are built to scale out with more resources. 13:22.000 --> 13:31.000 Cloud also has GPUs, which bring more performance for LLMs and can execute larger and higher quality LLMs. 13:31.000 --> 13:47.000 Cloud also has our HeatWave analytics engine, which provides faster analytics performance, allowing users to build combined applications with analytics, OLTP, AI. 13:47.000 --> 13:58.000 What is very important to note, all AI features we bring into MySQL AI are 100% API compatible with HeatWave in the cloud. 13:58.000 --> 14:02.000 So users can build their applications on MySQL AI. 14:02.000 --> 14:15.000 And should they feel the need for higher performance or an expanded feature set, and they want to move to the cloud, the applications will work without modifications. 14:15.000 --> 14:19.000 And we have tried very hard to make it 100% API compatible. 14:19.000 --> 14:40.000 So with MySQL AI, we have optimized inference of open-width LLMs on CPUs, right from one core all the way to 192 cores, using proprietary weight caching and quantization techniques. 14:40.000 --> 14:49.000 And this allows us to, the users to deploy MySQL AI on a range of computer infrastructure, depending on their need. 14:49.000 --> 14:51.000 It can be very small MySQL node. 14:51.000 --> 14:53.000 It can be a very beefy MySQL node. 14:53.000 --> 15:13.000 And as they improve the hardware, the performance, the latency, the quality, the concurrency we deliver increases, providing incentive to users to deploy on larger and larger hardware. 15:13.000 --> 15:25.000 Are we targeting, like, is MySQL AI targeted towards existing MySQL EE customers who are looking for more features? 15:25.000 --> 15:35.000 Or are we kind of targeting other potential customers and luring them in with AI to get them into the MySQL ecosystem? 15:35.000 --> 15:50.000 Both. We are bringing more features to MySQL EE, and we hope that more and more users, more classes of users, more classes of applications, find their home in MySQL. 15:50.000 --> 16:00.000 And absolutely, we want to allow our existing on-premise customers to be able to bring more of their workloads into MySQL. 16:00.000 --> 16:06.000 And for a number of reasons, many customers want to be on-premise. 16:06.000 --> 16:10.000 They might require deployments in the edge devices. 16:10.000 --> 16:13.000 They might require deployments in air-gapped environments for data security. 16:13.000 --> 16:21.000 So we want to enable these existing customers to bring more workloads. 16:21.000 --> 16:29.000 Of course, there are other customers who may not have ever looked at MySQL EE because they have requirements to deploy AI or to deploy generative AI. 16:29.000 --> 16:41.000 And we want to obviously give them this tool set and this enhanced feature set to bring their primary, secondary, tertiary workloads to MySQL EE. 16:41.000 --> 16:46.000 That's very nice to bring this to our on-prem customer. 16:46.000 --> 17:05.000 So correct me if I'm wrong, but to what I understood is that we modified LLMs to work on normal CPU for performance, and so we can run it. 17:05.000 --> 17:20.000 We don't need to have GPUs, but you also said that, yeah, if we really want to use a very large LLM, then it's better to use it to use the cloud than on-prem. 17:20.000 --> 17:33.000 But I wanted to also ask you, because LLMs that are evolving and not at the same speed of the MySQL releases. 17:33.000 --> 17:35.000 I have two questions in this one. 17:35.000 --> 17:42.000 It's like, oh, if there are new LLMs, will it be the possibility to the user to use it directly in GenAI? 17:42.000 --> 17:49.000 Or will it be updated at every new release of MySQL? 17:49.000 --> 18:01.000 Good question. So there's absolutely no exaggeration to state that a new LLM seemingly is released every month with its unique set of characteristics and benefits. 18:01.000 --> 18:20.000 So we will definitely bring new LLMs, which enhance the performance of all quality of results, as and when we feel that is useful to add to the existing set of LLMs we offer with future MySQL releases. 18:20.000 --> 18:31.000 So we will have we have taken the approach of building in LLMs in the package that we ship to our customers. 18:31.000 --> 18:37.000 There are many reasons for this. We have optimized these LLMs to run on CPUs. 18:37.000 --> 18:54.000 So we are able to run larger LLMs faster on CPUs, allowing our users to seamlessly use these LLMs without extra hardware or call outs to other services. 18:54.000 --> 19:02.000 So we will continue doing that with new LLMs as new ones prove to be useful for our users. 19:02.000 --> 19:14.000 What you say is that, OK, when we're going to release a new version of MySQL, if there are new LLMs that were interesting to update, they will be updated at that time, right? 19:14.000 --> 19:19.000 Build the LLMs into the package the users download and deploy. 19:19.000 --> 19:24.000 Users do not need to bring their own LLMs. 19:24.000 --> 19:51.000 Gurav, thank you for joining us today. It really was interesting for me, I can say personally, to learn about some of the advancements that are coming in MySQL AI and how it's going to be integrating with our EE version to allow people who need on premise or prefer on premise installations of MySQL to actually harness some of the AI power that we offer in HeatWave. 19:51.000 --> 19:54.000 Thank you, Scott. Thank you, Fred. 19:54.000 --> 19:55.000 Thank you very much. Bye bye. 19:55.000 --> 20:00.000 That's a wrap on this episode of Inside MySQL: Sakila Speaks. Thanks for hanging out with us. 20:00.000 --> 20:04.000 If you enjoyed listening, please click subscribe to get all the latest episodes. 20:04.000 --> 20:07.000 We would also love your reviews and ratings on your podcast app. 20:07.000 --> 20:12.000 Be sure to join us for the next episode of Inside MySQL: Sakila Speaks.

4. Sept. 2025 - 20 min
Episode Let HeatWave Drive: The AutoPilot Advantage Cover

Let HeatWave Drive: The AutoPilot Advantage

In this episode, leFred and Scott are joined by Onur Kocberber to explore the many features of HeatWave AutoPilot. Learn how AutoPilot's intelligent automation helps manage MySQL instances with ease, optimizes performance, and reduces operational costs. Onur shares practical insights and real-world examples showing how customers can streamline their database operations with HeatWave AutoPilot. ------------------------------------------------------------- Episode Transcript: 00:00:00:00 - 00:00:31:20 Welcome to Inside MySQL: Sakila Speaks. A podcast dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL project updates and insightful interviews with members of the MySQL community. Sit back and enjoy as your hosts bring you the latest updates on your favorite open source database. Let's get started! 00:00:31:22 - 00:01:03:00 Hello and welcome to Sakila Speaks, the podcast dedicated to MySQL. I am leFred and I'm Scott Stroz, joining us today is Onur Kocberber. Onur is currently a director of Development at Oracle, leading efforts on MySQL HeatWave, specifically working on the AutoPilot. Based in Oracle's Zurich office, Onur focuses in advanced research and development to improve cloud database performance through interpretable machine learning techniques. 00:01:03:02 - 00:01:24:16 He plays a key role in the ongoing growth of HeatWave, including work on new offering like the HeatWave Lakehouse and HeatWave GenAI service. Welcome, Onur. Thanks. Thanks leFred, thanks Scott. Great to be here. So Onur, can you tell us a bit about your journey? What led you to Oracle and specifically to the MySQL HeatWave team? All right. 00:01:24:16 - 00:01:53:10 So I, I was a grad student at EPFL Lausanne in Switzerland, and, I was doing research specific doing database, accelerators, both for, with hardware and software. And, at the time, I knew that Oracle Labs had a very exciting project about, building basically hardware, software, core design, database machines. And once I graduated, I knew that there were really good set of people. 00:01:53:10 - 00:02:21:18 And that's, how I joined. So I came to basically Zurich, to to the Oracle Labs branch. And then eventually, maybe fast forward ten years, we have, HeatWave database service, but, what we see includes MySQL and other things I will discuss today. That is fantastic. So, Onur, this entire season has been dedicated to, everything AI. 00:02:21:18 - 00:02:47:07 What AI offerings that HeatWave has and some of our listeners, I would guess maybe many of our listeners probably aren't too familiar with, HeatWave AutoPilot. Can you give us a high altitude overview of what AutoPilot is and, what problems that might be resolved? So the database systems today are all cloud databases, right? And, these are many services. 00:02:47:07 - 00:03:21:04 And the onus is on us, in terms of managing these systems. So the customers are expecting basically a full, full fledged, automated service with no, let's say rough edges. And that's where, AutoPilot, comes into play. And when we started the project, when, MySQL HeatWave was becoming a cloud service, we, also started the AutoPilot project, and, we basically targeted four different, let's say, problem domains. 00:03:21:04 - 00:03:53:04 So these are, setting up the system, data, basically loading the data or data management query execution and then failure handling. And, for each of these, categories, we basically looked at what, how we could, improve customer experience as well as customer performance. And at the same time, we put the machine learning, as one of our, basically main objectives because, this is a very old topic, right? 00:03:53:04 - 00:04:18:12 This is this is not a new topic like database management on automatic database, admins and DBAs and such. So that's why we took all the, academic research, plus the realities all today, which is the cloud services. And then, we looked at these four different pillars and then fast forward to today, we have like a double digit numbers in the AutoPilot suite. 00:04:18:14 - 00:04:55:12 Wonderful. And that's awesome. So and why then, this HeatWave AutoPilot is a game changer for users. Right. So, one of the things that we were seeing in the early days of our services that customers would sometimes put together, let's say, scripts or rules or let's say, some sort of, business practices, right? And in AutoPilot, we are taking all of those, especially what you're observing or what you're anticipating, right, that, the customers will have problems with. 00:04:55:16 - 00:05:18:07 And then we are offering them out-of-the-box ready to use for the for the customers. Some of those are fully automated, like, let's say, for or planned improvements. These are like these are happening completely transparent to the use it and some of the features that are a bit more about, the cost optimization of the service or performance optimizations are provided as an advisor. 00:05:18:08 - 00:05:43:03 So essentially we are constantly watching what the customer might, let's say, what would the cost of problems that the customers might have? And we are offering it out of the box included in the, in the service. And that is something, we see when we look at our competitors, we see that, some of the problems that we are solving are just seen as kind of still left as rough, rough edges. 00:05:43:05 - 00:06:02:08 And that's why it is really important. And at the core of it, we have a lot of machine learning models. These models are automatically up to...updated as we also update the version of the service. Therefore customers don't have to worry anything about, basically those, those, those problems that they are running into. Great. 00:06:02:08 - 00:06:31:10 Thank you. So, and when I follow what you just said, then, it seems that, these AutoPilot feature can save OCI customers some money, right? Right. So for certain cases, absolutely. For example, let's take auto provisioning. This is the feature that, the, made available almost at the same time when the, with the GA and, since our GA, this has been used, very actively. 00:06:31:10 - 00:06:54:02 And in this feature, for example, we say this is the number of nodes, that's, a customer should provision for accelerating their, analytical queries with HeatWave. And the great thing here is that, they don't have to overprovision their cluster or they don't, they don't need to under provision their cluster and then run into all sorts of possible issues. 00:06:54:04 - 00:07:13:07 So then one, one part of it is that they have the optimal cost, right? So they, they pay or they provision what they, what they should. And at the same time they also say, save time by just not having to, worry about it. And then similarly, for example, we have an auto load and unload feature. 00:07:13:07 - 00:07:40:05 So if you see there is some let's say there is going to be some benefit from from customer workload, we would automatically load or unload tables. And again, this would either give you a performance boost, which again translates into some sort of cost saving, or at the same time we would just, unload the unnecessary tables so that the customer wouldn't have to, let's say, increase their resource consumption, because they don't they don't have to. 00:07:40:07 - 00:08:15:12 And then we have a bunch of other like, similar features actually, that that will do. For example, there's auto compression that already gives you better price performance, but by default. Right. So that's definitely, every the most of the optimizations we do is translating into some sort of cost saving for the customers. That's awesome. I find that actually pretty, interesting that we offer ways to make sure the customer is basically streamlining their process, and then they're not overpaying for resources because some people might spin up a huge instance when they don't, in fact, need it. 00:08:15:14 - 00:08:39:07 So what are some features of AutoPilot that can help make storing and retrieving data a little bit more efficient? So I mean, let me give you an OLTP example. Of course auto indexing is is one of them. Right. So indexing, is definitely one of the holy grail problems in computer science, I would say. And we have a feature, that basically recommend secondary indexes. 00:08:39:07 - 00:09:04:23 So that's I see people ... people who are familiar with the MySQL know that how important indexes are. So we actually have an index advisor and that's, pretty effective. We see this today with customers as well. And that's just working really well. And having the right indexes is definitely making the, data retrieval, extremely efficient. 00:09:05:01 - 00:09:26:21 And if I were to give you an example from the analytical site, we, we have adaptive query execution. So we are basically over time, the improve the, the the query plan. Right. So this is also making, everything, a lot more efficient. And if I were to give maybe an example from the Lakehouse side. 00:09:26:21 - 00:09:57:14 So this is another, basically feature where we deal with semi-structured data. We do we, we automatically ingest, the unstructured files by understanding the, the, the schema. And, this way we can represent the unstructured data in the right format, which could translate into a better, let's say, space, usage guide so that you don't have to maybe pick a larger type than anticipated, than what the customer anticipated. 00:09:57:16 - 00:10:32:13 So and all these things, are they sometimes they look small, but these are the real problems because, especially when it comes to whether it's indexes or whether it is query plans or whether it is unstructured data, in all these instances, we are dealing with hundreds, if not thousands of either queries or tables and such. And and for a particular user, maybe dealing with 1 or 2 is easy, but dealing with thousands, I think every DBA would know or every user would know that it's, it's it's a tedious process with a lot of gotchas. 00:10:32:13 - 00:10:59:09 And corner cases will be basically take all these things into account in our AutoPilot suite. And then we update our, learnings and our optimizations as the versions go. But thank you. Yeah. Nice. Good answer. So what do you think are, the biggest misconception that the developers have about, machine learning driven, database optimization, right. 00:10:59:11 - 00:11:19:18 Because, yeah, there is the old DBA. Is that the they should know everything. And then the also thwy run the reports and sometimes people say, yeah, is it good or not I don't know. So do you know that, do you have an answer for this. Yes. So this is one of my favorite topics. Yeah. 00:11:19:19 - 00:11:52:18 So this is, something that we, we we have an internal discussion going on. Right. So I am also receiving a lot of requests from other teams or, people who are, and like, very, let's say, excited or ambitious about, applying like, machine learning to, to their domain, to their problems, like, one of the things that I keep seeing is that so there is, basically systems for ML and ML for systems that I, this is, I think, a very good, way of describing. 00:11:52:20 - 00:12:22:20 So we are at the end of the day, we are building computer systems and we should use ML for optimizing our computer systems. So and most of the time what I see is that like people who start, like basically trying to people who start trying to, apply ML, they put ML is a first object, whereas it should be actually not the first objective, it should be first the systems, how we build a system, a computer system. 00:12:22:22 - 00:12:46:22 And then we need to understand what is the hole, right, in our problem space that we can fill with machine learning. So most of the people who go and collect, let's say a data set and a the draw, let's say a regressor or a classifier on that data set. They say that it or it works well in the test, but it doesn't work in the in the real like a production like. 00:12:47:00 - 00:13:09:20 And to me this is the missing a systems inside. So we basically have to have a systems inside. So I will give you a very specific example. For HeatWave for when we are loading data into HeatWave, we can control InnoDB parallel thread and you know, parallel, like a thread count is a known lock that, anybody would know, let's say, how to tune. 00:13:09:22 - 00:13:32:01 But when it comes to the HeatWave load, it is, basically like the trigger behavior is changing, right? So basically we need to understand why the thread break here. The idea is changing. So that means that we have to collect the data in a way that we exercise the parts that HeatWave would exercise now. So if people were basically just saying, oh, like machine learning is going to solve everything for us, right? 00:13:32:01 - 00:13:54:09 Then we start with the data set. It will work. Basically, first the system inside, then the machine learning. You will be highly, highly effective. So that is why I think the second part is a while. Sure that what we put a lot of focus on is interpretability or explainability. So we try to fail our models first before the customers fail them. 00:13:54:10 - 00:14:23:06 alright and with that, we this is we just ship models, right. So this is, this is actually very, very important because otherwise it's some people might say, oh, you know, let's just fancy like a version of rule based tuning or it only works in certain cases. Right? So, basically to summarize, there is a lot of technical debt in machine learning models, but using the systems inside you can get good system engineer. 00:14:23:06 - 00:14:42:02 I think it's it's the real, secret sauce of shipping machine learning models that are effective at production. And then it's a long topic that after you ship it, you have to monitor them. You have to make sure that you are not regressing. There are not too heavy concepts and such. So yeah, what I would change that ML is machine learning is a tool. 00:14:42:04 - 00:15:12:07 And like any tool, it has its own drawbacks. And as long as we are aware of them, we can, these can provide really strong systems, that take advantage of ML. But again, systems first, ML second. That is my philosophy. I think that's actually a pretty good philosophy. So I know you might not be able to tell us everything, but are there any up any upcoming features that you're particularly excited about that you can actually talk about? 00:15:12:09 - 00:15:39:17 All right, so, well, I think this is not a surprise about generative AI. Right. So generative AI is, now the, the, the hottest, topic that, we are dealing with and, we, are working actively on generative AI based AutoPilot features, let's say. And one important difference there is that, when I say systems first, right, systems is about numerical data, right? 00:15:39:17 - 00:16:01:09 We deal with numbers that are coming in, let's say cache misses, buffer pool heat ratios, read write ratios right or performance and like all these numbers that they are essentially a time series that are just flowing in. And machine learning is like traditional machine learning is is very good at it or sometimes categorical, like, the data set, right? 00:16:01:11 - 00:16:23:22 When it happens, like should, it should have been this way or another. Right. Those are easy. But what is generative AI now bringing is, being able to deal with completely unstructured data. So what is unstructured data is text. For example the text. Then what text means means generating SQL code or text means dealing with logs, right. 00:16:23:22 - 00:16:53:15 Or log files or or automatically, thinking into, let's say like, your own, let's say diagnosis. That goes to data insight. So those are the areas, let's say that, that we are working on. Excellent. So because you're talking about, AI, GenAI, the resources is something, we hear, more and more in the database, area. 00:16:53:16 - 00:17:21:12 So, because HeatWave AutoPilot brings us, already a lot of, intelligence, automation but, what about the the natural language to a SQL. So the NL to SQL, do you think, this will be also something that, will come, for us and, and do you see it as a serious productivity tool for the analyst and the developers? 00:17:21:14 - 00:17:52:18 Are there still, orders, to making it available and, enough, for a production use? Right. That's a very good question. Okay. This is definitely a very hot topic for for everyone in the industry, I believe. And, so, yeah. So what is really happening in the NL to SQL domain is, as the large language models are getting larger, we are seeing a very big improvement in the accuracy of these tools. 00:17:52:20 - 00:18:36:02 And of course, what I mean, my accuracy of the known benchmarks. Right. And also what is really... Another interesting trend that is happening is that, you know, you look at traditional, like benchmarks, like for OLTP, OLAP, you know, TPC-C, TPC-H, TPD-DS right? They've been around for a very long time. What is interesting about, this NL to SQL benchmarks is that the more that they're out, it's, it takes maybe a year or less than a year to, to to get really good scores, you know, like, people are conquering is benchmarks and they're very fast, pace so it is true that it was not ready, but now we are seeing 00:18:36:02 - 00:18:58:06 signs that it is actually, they are getting very, very good at it. And and of course, it really depends on your, your complexity, how many SQL constructs you have. Right. And how complex, it could become because you certain SQL is just pages. Right. So that that means you need an assistant. But certain SQL is just, you know, you just want to learn something about your database. 00:18:58:08 - 00:19:17:02 Maybe you're a business executive or maybe you're a data analyst who is not very well versed in SQL. I can tell good for for for those type of use cases where you're a business analyst or whether you're a data analyst, I think that the tools are definitely there. So more complicated queries where even humans don't write in one go, right? 00:19:17:02 - 00:19:41:15 Like its an NL to query it it's pages right there. It's a great system, but it is performing maybe similar to the coding assistance that that we have todat. But yeah, it's it's definitely in a corner like, it like is is basically something that everybody is looking at actively and so, so we are so that's, I think you will hear, 00:19:41:15 - 00:20:04:08 Hopefully, some more cool news about it, soon.. That's awesome. That's actually something I'm really interested in. Just because, like you said, allowing people to query data without actually having the SQL knowledge is, is kind of intriguing. So one last question. As you, as would the three of us should know. You know, here at Oracle, we tend to eat our own dog food. 00:20:04:10 - 00:20:33:12 And it helped improve MySQL in some areas like high availability where we tuned group replication. Do you have a similar experience with the AI related tools that you could, talk about? Oh, yes. That's a very good, question. And this is an active topic that is, basically, happening right now, within the MySQL Org. So, yeah, I'm working on two different projects. 00:20:33:17 - 00:20:59:02 One is, we are using generative AI, our own generative AI service. I think they've generative AI service to generate, HeatWave release notes. So if you go to mysql.com today and if you look for HeatWave release nodes, you will see that there is actually banner up there that says these nodes are generated with the assistant off assistance of, HeatWave generative AI service. 00:20:59:04 - 00:21:20:12 And this is a system that we built in, again, purely running on our, software. And, it's working. Our technical writers love it. And we have actually working on several other improvements that we we are trying to write more, with that. And it is going to come. So there's something of the, quote unquote drinking our own champagne, right. 00:21:20:14 - 00:21:50:14 And, with that also reflected back right to the, to our generative AI team. So the AutoPilot team initially basically said, okay, like, these are the things that you should improve because so if you're running into these problems, our customers will also run into these problems. Right. So that's that. Is there one one one good. Also way of let's say, improving our product and another one, it's, something we call Ask MySQL Expert in short Ask ME. 00:21:50:16 - 00:22:17:17 So we are we is to build a general, question and answer machine that is able to, aanswer questions like the how to questions or the troubleshooting questions internally, like within the within the MySQL org and some of it's actually we, we demoed in some of the keynote speeches that that we gave and the also recently released a version of this as a sample app to our customers. 00:22:17:19 - 00:22:40:05 So basically, one part is that it's a question and answer machine that we are using internally. And, we, we got really good feedback. There are, certain cases where, especially junior, engineers, they learn a lot because they need to onboard, faster, than let's say, you know, basically compared to the compared to the past. 00:22:40:05 - 00:23:08:11 So that is why I think because there's a lot of knowledge to, to carry up to the you to use this tool. And at the same time we release this version to our cust... to our customers. But this one is something that they could bring their own data to build their own. Asked me let's it right? Of course, in this case they could call the tool something different because the case is something they could just, extend so they can just open their own data and, and build this, question and answer, let's say, machine. 00:23:08:13 - 00:23:33:19 And IT will, of course, we are evolving these tools forward so that eventually some of it could be also be part of our, our service. So and, yeah, these are actually two specific examples where we are using generative AI actively. There is a lot more, the only I think, you know, limitation in this case is like, is for us to catch up with this technology, right? 00:23:33:19 - 00:24:00:14 Because the Geratvie AI space is moving really fast and identifying what is working and what is not working and what is useful today. Our what is just forward looking. It is maybe let's say 10% or of of of of my time, like just so that we are not always also working on, very big, let's say, blue sky ideas as opposed to kind of making people's lives easier today. 00:24:00:16 - 00:24:28:03 Thank you very much. So yeah, I saw during the MySQL summit the example of the, of the app, with, the knowledge base, but for example, for the HeatWave release note that I wasn't even aware of. And, after this recording, I will watch it immediately, just for fun. So, to wrap up, I want to I have something in mind that I want to ask you, if you agree with, but, with the MySQL HeatWave service. 00:24:28:03 - 00:24:52:10 Right. We have AI that, so you can build your AI application or, AI anything with the vector search. You you have, you know, HeatWave that will help for you doing AI, but it seems with the AutoPilot and stuff we have also AI that helps, MySQL HeatWave users to improve your experience. Right. 00:24:52:12 - 00:25:28:18 So that's a very good distinction. And I have to admit that sometimes I am also not, let's say making, like, drawing that, like, you're absolutely right. So some of these applications we built are for customers and they could just go and extend it, or they can take inspiration from that and build something else. Right. So that's we're demonstrating our own, technology that, when it comes to AutoPilot, it is essentially another way of, let's say distributing this application, but it's all under control and it is out of our service that it's building to our service. 00:25:28:20 - 00:25:50:19 And you're absolutely right. In both cases, we are using similar, ideas or similar technologies. And, one of them, again, we are giving it to the people so that they could go and extend it the way they like and the others is is our under control. We see how people are using it and we are improving it as as we go along. 00:25:50:19 - 00:26:11:04 That's why there's always a new, let's say, auto feature. I mean, coming out and then sometimes the lines are actually pretty blurry, like, okay, I don't want to make things complicated, but one thing is that we are also working on some intersection where let's say that you have an application that, let's say, uses AutoPilot in a way that let's say a user would interact, but. 00:26:11:04 - 00:26:33:21 Right. So those these are interesting boundaries that we are always, looking. But from the product communication or maybe from the road network perspective, we, we don't really talk too much about this kind of stuff because it's just, maybe it's a little bit more like an intellectual exercise for us to see the limits of our technologies. Thank you very much, Onur, for, taking the time to talk with us. 00:26:33:23 - 00:26:53:10 I thank you, thanks for having me on this. Thank you. Onur. That's a wrap on this episode of inside my exclusive Killer Speaks. Thanks for hanging out with us. If you enjoyed listening, please click subscribe to get all the latest episodes. We would also love your reviews and ratings on your podcast app. Be sure to join us for the next episode of Inside Mysql: 00:26:53:10 - 00:27:05:18 Sakila Speaks.

21. Aug. 2025 - 27 min
Episode HeatWave Hot Takes: The Power of ML and GenAI Cover

HeatWave Hot Takes: The Power of ML and GenAI

In this episode, leFred and Scott welcome Jayant Sharma and Sanjay Jinturkar to the Sakila Studio for an insightful conversation on machine learning and generative AI within HeatWave. Discover how these cutting-edge technologies are integrated, what makes HeatWave unique, and how organizations can leverage its capabilities to unlock new possibilities in data and AI. Tune in for practical insights, real-world use cases, and a closer look at the future of analytics. ------------------------------------------------------------ Episode Transcript: 00:00:00:00 - 00:00:32:01 Welcome to Inside MySQL: Sakila Speaks. A podcast dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL project updates and insightful interviews with members of the MySQL community. Sit back and enjoy as your hosts bring you the latest updates on your favorite open source database. Let's get started! 00:00:32:03 - 00:00:54:17 Hello and welcome to Sakila Speaks, the podcast dedicated to MySQL. I am leFred and I'm Scott Stroz. Today for the second episode of season three dedicated on AI. I am pleased to welcome Sanjay Jinturkar. Sorry if I pronounce it badly. No, you did it right. Hi there. Thank you. So Sanjay is the senior director at Oracle based in New Jersey. 00:00:54:19 - 00:01:21:13 He leads product development for it with AutoML and GenAI with a strong focus on integrating these technologies directly into each HeatWave database. And Sanjay has been instrumental in enhancing HeatWave's machine learning and GenAI tool sets, enabling use case like predictive maintenance, fraud detection and intelligent dicument and Q&A. And also we have a second guest today. 00:01:21:13 - 00:01:48:21 It's a Jayant Sharma. Hi, Jayant. Hello. So Jayant Sharma is senior director of product management at Oracle. He has over 20 years of experience in databases, spatial analytics and application development. He's currently focused on the product strategy and design of the Heatwave MySQL managed services offering. Hey Fred. Thank you, both of you for joining us today. So I'm going to dive right in with the question for Jayant. 00:01:48:23 - 00:02:12:14 Why did Oracle decide to integrate machine learning in generative AI capabilities directly into HeatWave? Thank you Scott, first for this opportunity. And yes, we have to start with first, you know, talking about MySQL, right? MySQL is the world's most popular open source database. And what do all of these customers, the thousands of customers that they have, do with it? 00:02:12:16 - 00:02:47:05 They manage a business process. They manage their enterprise, right? Their focus is on what they want to do, why they want to do it, and not so much the how. That's what MySQL makes it easier. And Heatwave is a managed service on MySQL. Okay, so as folks are modernizing their applications, taking advantage of new technology, they want to be able to use new workloads, new analytics, and modernize their business processes, make it more efficient, make it more effective. 00:02:47:07 - 00:03:09:17 In order to do that, they want to do things such as machine learning and use the benefits of generative AI. However, what they want to focus on, as we said, is what they want, why they want to do it and not the how. So they don't want to have to think about. I have all of this data that's potentially a goldmine. 00:03:09:19 - 00:03:40:07 How do I extract nuggets from it, and how do I safely move it and transfer in between the best of breed tools? I want to be able to do things where they are. I want to bring the capabilities, these new capabilities to my data. I don't want to take my data to where those capabilities are exposed, right? That is why we made it possible to do machine learning and GenAI where your gold mine is, where your data is in MySQL in Heatwave. 00:03:40:09 - 00:04:06:07 Awesome. Thank you. So, I would like to ask you to Sanjay, then. How Do the the, machine learning engine in the HeatWave, offer differ from, using external machine learning pipelines with the with the data we have in the database? It differs in a couple of weeks, specifically how the models are built, who builds them and where they are built. 00:04:06:09 - 00:04:46:09 So our pipeline, we provide, automated pipeline, which can take your data in MySQL database or Lakehouse, and then automatically generate the model for you. So it does the, usual tasks of pre-processing, hyperparameter optimization, and, data cleansing, etc. automatically so that the user doesn't have to do that. We would even go ahead and do, explanations for you in certain use cases, given that this is automated, a big side effect of that is users don't need to be experts in machine learning. 00:04:46:11 - 00:05:16:08 What they need to focus on is their business problem, and how that business problem maps onto one of the features that we provide. From there onwards, the pipeline takes over and generates the models for it. And the third piece is that all of this work is done within HeatWave. We don't take the data going back to what Jayant was say, saying, we have got machine learning and generative AI to where the data resides, not the other way around. 00:05:16:10 - 00:05:47:20 So we are building the models inside Heatwave whereby the data is not taken out and thereby it is more secure and the user does not have to worry about data leakage or track where all they have taken the data and how many times they have done it. So these are the three key ways in which we differ. If you use one of the third party solutions, they will end up asking you to do this on your own or asking you to take the data out of the database and build it on your machine, so on and so forth. 00:05:47:22 - 00:06:21:06 But we have made it automated, easy to use and very secure to do so. So Sanjay, we're going to stay with you to, to keep talking about AutoML in HeatWave. So what are some of the key features of AutoML and how does it simplify model training and deployment for users? Fantastic question. You know, as I said in my in the previous, conversation, we are hitting the common tasks that are associated with model training and deployment. 00:06:21:08 - 00:06:46:03 So let's take training here. Typically when the user has to train a model, they are going to take their data. They will clean it up, do some pre-processing. Then they will figure out which particular algorithm they should be using. Tune those algorithms in doing the hyperparameter tuning, so on and so forth. All of these are individual tasks. 00:06:46:05 - 00:07:12:15 Our goal is to have the user focus on their business problem and take away the engineering piece of it, take away the technology piece of it, and do it automatically for them. So we have this pipeline which does this, all of it, all of it automatically in a single pass. So it will do pre-processing. It's going to figure out, the appropriate algorithm to use during model building. 00:07:12:17 - 00:07:39:05 It will figure out what are the best set of hyperparameters and what their values should be, during the training process and give you the, the model. So that's one part the second part is we provide an ability to deploy these models via REST interfaces. So once the model is trained they can deploy this. 00:07:39:07 - 00:08:09:09 And thirdly from time to time the users data is going to drift. Or what I mean by that is the train model. The data on which it was trained no longer reflects the reality. And in that case, you have to retrain the model. So we provide tools to measure that drift. And if it goes beyond a certain threshold, then you can go ahead and retrain your model automatically. 00:08:09:11 - 00:08:53:01 So these are a couple of ways in which we have simplified the model training and the deployment for users. Thank you. Thank you very much for this, detailed, answer. And now... So as we discussed about, you know, the, the data not leaving, to a third party, product. But I would like to, to ask, to, Jayant, if, if there were some performance improvement that, users have seen by doing this, ML natively in HeatWave, instead of removing the data, to external platforms. Certainly, Fred. 00:08:53:03 - 00:09:24:01 So there are two aspects to this. There's, there are efficiencies that, result and there are performance improvement because of the way AutoML is implemented and how it works in HeatWave. Let's start with the efficiency first. The first thing as Sanjay was talking about right, is that we've automated the pipeline. You have to only focus on what is your business problem and how that maps to a particular task in machine learning. 00:09:24:01 - 00:09:47:04 So for example, do I want to predict something. And therefore use regression, do I want to identify or label something and therefore use classification. And AutoML will figure out which particular algorithm. There are multiple ways in which you may do regression, for example, which particular one applies or is best suited for the task at hand. Right. 00:09:47:04 - 00:10:15:06 So efficiency there is AutoML handles it in a single pass, not the normal process requires you to have an iterative do things multiple times. Try it on multiple algorithms or different ways of solving the same problem, and then evaluate which one does it best. AutoML does this in a single pass by. Very smart ways of sampling your data and running quick tests to identify the best approach. 00:10:15:08 - 00:10:35:15 So that's the efficiency. The second when it does this, why is it so fast? It's so fast because it uses it the full capability of the underlying infrastructure, which is the HeatWave nodes. Right. The number of heat wave nodes you've got the size of these HeatWave nodes. It does these things in parallel and fully utilizes the infrastructure. 00:10:35:17 - 00:11:02:22 So what is the benefit of that? You can do things a) faster and b) potentially cheaper, which gives you the luxury of trying multiple what if scenarios. Right. It's not a laborious process. It's more efficient. So if you know exactly what you want, you get it done faster. If you want to try multiple scenarios, you can do that faster and at a lower cost. 00:11:03:00 - 00:11:32:23 So that is the efficiency and the performance enhancements that you get. Awesome. So all right let's switch gears a little bit GenAI is one of the latest additions to HeatWave. What specific GenAI features are currently available or if you can talk about them in development? So, Scott, indeed. GenAI has been one of the latest additions to our platform and frankly it encompasses two separate components. 00:11:33:01 - 00:12:04:05 One is the customer usage part and the second is the technology part. So from a customer usage perspective, what people want to do is bring in their knowledge bases. And by that I mean bring in the PDF documents, PowerPoint documents, so on and so forth, and ask questions of that or get summaries of that text, or translate that text into another language, or develop a chatbot around it so that they can get answers, things of that nature. 00:12:04:07 - 00:12:36:19 So what we have done is keeping this in mind for an enterprise setting. We have developed the technology components which are needed to serve these needs, such that going back to the earlier conversation, they focus on their business needs, and we provide them the tools to actually, serve those or we provide the plumbing to do so. So what we have done is to provide a full pipeline to ingest their documents and create the knowledge base. 00:12:36:19 - 00:13:04:06 And by that I mean bring in your PDF documents, which will get converted into embeddings and stored into vector store. So we provide all of that. And then we provide ways in which to search this knowledge base and give answers to the users via easy to use APIs like retrieval augmented generation (RAG), or doing just semantic search over those documents or doing summarization or translation. 00:13:04:08 - 00:13:32:14 We also have the ability to, support chat. Now, one very interesting thing that we have done is to provide the users a LLMs, which ran on commodity hardware. Jayant was talking about running this on HeatWave nodes. So we, we, we have provided these LLMs which found on commodity hardware so that people can quickly prototype their application to test it out. 00:13:32:16 - 00:14:00:13 And if they like the results of the like the performance, they stick with it. Or if they want high performance, and then they go to our OCI GenAI services and use those LLMs. So, quick prototyping, quick testing, quick evaluation done using the commodity LLMs commodity, the LLMs which are running on commodity hardware. And then they can use the OCI GenAI LLMs to get high performance. 00:14:00:15 - 00:14:26:07 Now going to your question about what newer things are coming in. You know, OCI is at the forefront of this revolution. And they are providing, newer models, newer frameworks and tools. And we are continuously incorporating MySQL and Heatwave with, their tools and technologies so that we can provide the same to our customers, in coming weeks and months. 00:14:26:08 - 00:14:53:23 Yeah. And then an example would be the agent framework. Right. Integrating with the agent framework, integrating with the hosted frontier models on GPU infrastructure. So you develop your prototype, develop, you can choose to deploy. The integration is preexisting. You don't it's not an after the fact exercise. You use the same infrastructure. And we provide the pre-built integration with those AI services. 00:14:54:01 - 00:15:28:22 Excellent. So because you are talking about, using, these, LLMs, on commodity hardware, which model, are available from these LLMs and how are we using them? So, we provide as I mentioned earlier, we provide two sets of, access to two sets pof LLMs. One are the smaller parameters LLMs, which are running on, commodity hardware on the same cluster on which HeatWave is run. 00:15:29:00 - 00:16:00:12 And in that context, we provide models like, mistral 8 billion or, llama 3 billion, 1 billion. And we are continually sort of upgrading these as newer models come into play, since this is a very fast moving fleet. Also, once people have prototyped it, once people have gotten to see the results, if they want to switch over to OCI GenAI LLMs, then we provide access to the wide variety of LLMs that OCI provides. 00:16:00:14 - 00:16:31:11 Those includes, you know, things like all the llama models, the upcoming models from all the other vendors, all of that is going to be accessible through, all of that is accessible through, MySQL. And these can be used for, as I said, but I do have use cases, be it retrieval augmented generation, chat bots, summarization, translation and many other use cases that, you know, customers continue to think of. 00:16:31:11 - 00:16:57:06 And we are always amazed at the way they are using, this technology. So when we talked to Matt Quinn, in our first episode, he had mentioned, he briefly touched on, security and privacy issues when it comes to GenAI or when it comes to AI. And we all know that OCI's, biggest concerns are and always have been, security and privacy. 00:16:57:07 - 00:17:26:12 How is data privacy managed when training models or generating text using GenAI in HeatWave? Thank you for that, Scott. So security and privacy and data privacy are and have always been Oracle's primary concern. Right. It is true for the OCI the infrastructure. And it was built with security in mind. It has been true of Oracle's ever since Oracle since its inception. 00:17:26:14 - 00:18:04:09 So that is core that is our DNA including for MySQL and Heatwave. So the two primary ways in which this is a benefit here. Number one, the data doesn't move. Number two when it does have to move, as you said, in the case of text generation etc., we don't use only send the relevant snippets. So Sanjay talked about the fact is that you just the documents and then you create embeddings, which essentially that means is you look at various snippets, the model creates a semantic, you know, captures the semantics of it. 00:18:04:09 - 00:18:33:08 So that you can do a similarity. Right. Do you want to search for, GenAI? You'll also get documents that talk about retrieval augmented generation. Even though you didn't ask about retrieval augmented generation, because the embeddings know that these two things are related. So we extract the relevant content and only that is then sent. For example, if you want to if you're using in database a LLMs nothing is sent anywhere, everything stays in your environment. 00:18:33:09 - 00:18:58:04 If you're using an OCI service, for example, the relevant pieces sent that you get the answer and OCI is built, as you said, with security in mind. So the relevant piece that you sent is discarded after it has answer your question. So once again, your data doesn't leave the ecosystem. It stays within the Oracle infrastructure. Your infrastructure. Great and secure like we want. 00:18:58:06 - 00:19:29:03 Exactly. So thank you very much. No. The tricky question that, we have every time we present, anything new or compelling, at conference and stuff, it's that. And I would like to ask you if you could share, with us, a few compelling customer stories or a real world application of, the, ML engineand GenAI in HeatWave, because it's bice theoretically. 00:19:29:03 - 00:19:47:23 But are people using it? Really? And who are they? If you can say, I know we cannot always share names, but maybe some, are public. So if you can explain us a bit of that would be great. Yeah, Fred, that's a very good question. And indeed, all of this makes sense if customers are using it. 00:19:48:01 - 00:20:31:14 So, yeah, let me share a couple of them. In fact, let me share two of them, one on GenAI and one on, AutoML, so on GenAI. And this is a public reference. We have a customers, Smarter D their goal, or what they provide is a platform to track the compliance of a company's policy to the guidelines released by various, bodies, for example, DoD. now, in the past, what they have done is to have an auditor, which is going to compare the policy with respect to the guideline and then say if a specific companies are adhering to those guidelines or not, which is kind of 00:20:31:14 - 00:21:04:12 a, manual process, fairly intensive, right? What they did was to start using, Heatwave GenAI and they stored both the policies and the guidelines in HeatWave GenAI's knowledge based. And then using our RAG using our other capabilities, they compared the policy with the guidance automatically. And the results of these are then given to the auditor. 00:21:04:14 - 00:21:28:15 So now what does what does this do. This reduces the amount of time the auditor has to spend comparing the two. Instead, he or she gets a, very short description of what has been done, what has not been done, and whereby they can make a judgment whether it is indeed adhering or not. So it makes the auditor far more efficient compared to earlier. 00:21:28:17 - 00:21:57:04 So that's a that's a use case in the compliance space for HeatWave. GenAI, I, I would give me another example here. I can't, quote the name, but this is in the industrial setting where the customer wants to have early signals of failures of their computer servers. So they are running, large amount of, workloads. 00:21:57:07 - 00:22:39:20 And these workloads generate logs, and those logs sometimes are indicative of failures. So what they do is they use our log anomaly detection, tool models that, they have built actually, and feed their logs continuously to these models. And these models are trained to detect abnormal logs. So when the model notices abnormal logs, it's going to go ahead and give you an early warning of the fact that a particular system is likely to fail. 00:22:39:22 - 00:23:03:14 An example of that would be hey, your CPU utilization is trending higher, or your memory utilization is going up, or the number of network connections are going up. All of this the model best efforts based on the data it was trained earlier and then in real time. It looks at the newer data and figures out, you know what, this is looking problematic. 00:23:03:20 - 00:23:29:22 So let me give you an early warning. And that's what they are doing. So these are two, kind of use cases that some of our customers are two of our customers of, have developed and, use. Great. So just to add to what they were saying on the second case, right, it's high tech manufacturing, their assembly line, the manufacturing process, which is if something goes wrong and that process is interrupted, that's expensive. 00:23:30:00 - 00:23:52:13 Right. And that is what is being, the benefit here is that they get an early warning and they can mitigate, so they get a chance. They have monitoring systems. Obviously, the monitoring system tells you after the fact something went wrong. Go fix it. The log analytics, the predictive part gives you an early warning so you can mitigate the likely. 00:23:52:15 - 00:24:21:14 Yeah, you can either prevent or you can reduce the effect of something getting affected on the assembly line. That is really cool. So it's interesting that, Sanjay, you talked about the compliance issue because when we talked to Matt in the last episode, that was actually an example that he used as well, but he was just talking generally not specific, you know, not a specific, customer or, real world, application of it. 00:24:21:14 - 00:24:50:06 He was just throwing it out there like, this is what you could do. So it's really cool that you brought that up to, to wrap up, what advice would you give to MySQL users who want to start exploring AI with Heatwave? Simple. Get started with the always free instance. So today, and in case y'all aren't aware and folks aren't aware, you can get an always free instance in your own region in OCI. 00:24:50:06 - 00:25:14:15 If you, you know, sign up on OCI account and create an always free your instance. And that actually has the GenAI and AutoML capability. Now to help you get started, we've got live labs that walk you through scenarios. Right. So we've got 4 or 5 live labs - two on GenAI, two on ML and a few on specific use cases. 00:25:14:15 - 00:25:45:16 For example, wholesale SailGP uses machine learning with MySQL to help these people perform better in a race. But more importantly, once you we also have a collection of videos. Some of them by you Scott, for example, on what you can do with AutoML, what you can do with GenAI, what you can do with it in healthcare, what can you do with it in e-commerce and etc.. 00:25:45:16 - 00:26:07:00 So and sample code on GitHub on our GitHub site, sample notebooks, sample SQL code. Try them out on your always free instance. Or if you have access to a beefier instance because you have credits. Try it out on that. And then if you have a specific thing in mind, feel free to reach out to any one of us. 00:26:07:00 - 00:26:25:06 If you want help to do a prototype or do a POC, we'd be more than happy to do that. Nice. Thank you very much. So it was a pleasure to have you both here, Sanjay and Jayant. So thank you very much. To participate to this podcast. So thank you and bye bye. Bye bye. Thank you. Bye bye. Thank you. 00:26:25:06 - 00:26:53:23 Guys, that's a wrap on this episode of MySQL: Sakila Speaks. Thanks for hanging out with us. If you enjoyed listening, please click subscribe to get all the latest episodes. We would also love your reviews and ratings on your podcast app. Be sure to join us for the next episode of MySQL: Sakila Speaks. Episode Transcript: 00:00:00:00 - 00:00:32:01 Unknown Welcome to Inside MySQL: Sakila Speaks. A podcast dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL project updates and insightful interviews with members of the MySQL community. Sit back and enjoy as your hosts bring you the latest updates on your favorite open source database. Let's get started! 00:00:32:03 - 00:00:54:17 Unknown Hello and welcome to Sakila Speaks, the podcast dedicated to MySQL. I am leFred and I'm Scott Stroz. Today for the second episode of season three dedicated on AI. I am pleased to welcome Sanjay Jinturkar. Sorry if I pronounce it badly. No, you did it right. Hi there. Thank you. So Sanjay is the senior director at Oracle based in New Jersey. 00:00:54:19 - 00:01:21:13 Unknown He leads product development for it with AutoML and GenAI with a strong focus on integrating these technologies directly into each HeatWave database. And Sanjay has been instrumental in enhancing HeatWave's machine learning and GenAI tool sets, enabling use case like predictive maintenance, fraud detection and intelligent dicument and Q&A. And also we have a second guest today. 00:01:21:13 - 00:01:48:21 Unknown It's a Jayant Sharma. Hi, Jayant. Hello. So Jayant Sharma is senior director of product management at Oracle. He has over 20 years of experience in databases, spatial analytics and application development. He's currently focused on the product strategy and design of the Heatwave MySQL managed services offering. Hey Fred. Thank you, both of you for joining us today. So I'm going to dive right in with the question for Jayant. 00:01:48:23 - 00:02:12:14 Unknown Why did Oracle decide to integrate machine learning in generative AI capabilities directly into HeatWave? Thank you Scott, first for this opportunity. And yes, we have to start with first, you know, talking about MySQL, right? MySQL is the world's most popular open source database. And what do all of these customers, the thousands of customers that they have, do with it? 00:02:12:16 - 00:02:47:05 Unknown They manage a business process. They manage their enterprise, right? Their focus is on what they want to do, why they want to do it, and not so much the how. That's what MySQL makes it easier. And Heatwave is a managed service on MySQL. Okay, so as folks are modernizing their applications, taking advantage of new technology, they want to be able to use new workloads, new analytics, and modernize their business processes, make it more efficient, make it more effective. 00:02:47:07 - 00:03:09:17 Unknown In order to do that, they want to do things such as machine learning and use the benefits of generative AI. However, what they want to focus on, as we said, is what they want, why they want to do it and not the how. So they don't want to have to think about. I have all of this data that's potentially a goldmine. 00:03:09:19 - 00:03:40:07 Unknown How do I extract nuggets from it, and how do I safely move it and transfer in between the best of breed tools? I want to be able to do things where they are. I want to bring the capabilities, these new capabilities to my data. I don't want to take my data to where those capabilities are exposed, right? That is why we made it possible to do machine learning and GenAI where your gold mine is, where your data is in MySQL in Heatwave. 00:03:40:09 - 00:04:06:07 Unknown Awesome. Thank you. So, I would like to ask you to Sanjay, then. How Do the the, machine learning engine in the HeatWave, offer differ from, using external machine learning pipelines with the with the data we have in the database? It differs in a couple of weeks, specifically how the models are built, who builds them and where they are built. 00:04:06:09 - 00:04:46:09 Unknown So our pipeline, we provide, automated pipeline, which can take your data in MySQL database or Lakehouse, and then automatically generate the model for you. So it does the, usual tasks of pre-processing, hyperparameter optimization, and, data cleansing, etc. automatically so that the user doesn't have to do that. We would even go ahead and do, explanations for you in certain use cases, given that this is automated, a big side effect of that is users don't need to be experts in machine learning. 00:04:46:11 - 00:05:16:08 Unknown What they need to focus on is their business problem, and how that business problem maps onto one of the features that we provide. From there onwards, the pipeline takes over and generates the models for it. And the third piece is that all of this work is done within HeatWave. We don't take the data going back to what Jayant was say, saying, we have got machine learning and generative AI to where the data resides, not the other way around. 00:05:16:10 - 00:05:47:20 Unknown So we are building the models inside Heatwave whereby the data is not taken out and thereby it is more secure and the user does not have to worry about data leakage or track where all they have taken the data and how many times they have done it. So these are the three key ways in which we differ. If you use one of the third party solutions, they will end up asking you to do this on your own or asking you to take the data out of the database and build it on your machine, so on and so forth. 00:05:47:22 - 00:06:21:06 Unknown But we have made it automated, easy to use and very secure to do so. So Sanjay, we're going to stay with you to, to keep talking about AutoML in HeatWave. So what are some of the key features of AutoML and how does it simplify model training and deployment for users? Fantastic question. You know, as I said in my in the previous, conversation, we are hitting the common tasks that are associated with model training and deployment. 00:06:21:08 - 00:06:46:03 Unknown So let's take training here. Typically when the user has to train a model, they are going to take their data. They will clean it up, do some pre-processing. Then they will figure out which particular algorithm they should be using. Tune those algorithms in doing the hyperparameter tuning, so on and so forth. All of these are individual tasks. 00:06:46:05 - 00:07:12:15 Unknown Our goal is to have the user focus on their business problem and take away the engineering piece of it, take away the technology piece of it, and do it automatically for them. So we have this pipeline which does this, all of it, all of it automatically in a single pass. So it will do pre-processing. It's going to figure out, the appropriate algorithm to use during model building. 00:07:12:17 - 00:07:39:05 Unknown It will figure out what are the best set of hyperparameters and what their values should be, during the training process and give you the, the model. So that's one part the second part is we provide an ability to deploy these models via REST interfaces. So once the model is trained they can deploy this. 00:07:39:07 - 00:08:09:09 Unknown And thirdly from time to time the users data is going to drift. Or what I mean by that is the train model. The data on which it was trained no longer reflects the reality. And in that case, you have to retrain the model. So we provide tools to measure that drift. And if it goes beyond a certain threshold, then you can go ahead and retrain your model automatically. 00:08:09:11 - 00:08:53:01 Unknown So these are a couple of ways in which we have simplified the model training and the deployment for users. Thank you. Thank you very much for this, detailed, answer. And now... So as we discussed about, you know, the, the data not leaving, to a third party, product. But I would like to, to ask, to, Jayant, if, if there were some performance improvement that, users have seen by doing this, ML natively in HeatWave, instead of removing the data, to external platforms. Certainly, Fred. 00:08:53:03 - 00:09:24:01 Unknown So there are two aspects to this. There's, there are efficiencies that, result and there are performance improvement because of the way AutoML is implemented and how it works in HeatWave. Let's start with the efficiency first. The first thing as Sanjay was talking about right, is that we've automated the pipeline. You have to only focus on what is your business problem and how that maps to a particular task in machine learning. 00:09:24:01 - 00:09:47:04 Unknown So for example, do I want to predict something. And therefore use regression, do I want to identify or label something and therefore use classification. And AutoML will figure out which particular algorithm. There are multiple ways in which you may do regression, for example, which particular one applies or is best suited for the task at hand. Right. 00:09:47:04 - 00:10:15:06 Unknown So efficiency there is AutoML handles it in a single pass, not the normal process requires you to have an iterative do things multiple times. Try it on multiple algorithms or different ways of solving the same problem, and then evaluate which one does it best. AutoML does this in a single pass by. Very smart ways of sampling your data and running quick tests to identify the best approach. 00:10:15:08 - 00:10:35:15 Unknown So that's the efficiency. The second when it does this, why is it so fast? It's so fast because it uses it the full capability of the underlying infrastructure, which is the HeatWave nodes. Right. The number of heat wave nodes you've got the size of these HeatWave nodes. It does these things in parallel and fully utilizes the infrastructure. 00:10:35:17 - 00:11:02:22 Unknown So what is the benefit of that? You can do things a) faster and b) potentially cheaper, which gives you the luxury of trying multiple what if scenarios. Right. It's not a laborious process. It's more efficient. So if you know exactly what you want, you get it done faster. If you want to try multiple scenarios, you can do that faster and at a lower cost. 00:11:03:00 - 00:11:32:23 Unknown So that is the efficiency and the performance enhancements that you get. Awesome. So all right let's switch gears a little bit GenAI is one of the latest additions to HeatWave. What specific GenAI features are currently available or if you can talk about them in development? So, Scott, indeed. GenAI has been one of the latest additions to our platform and frankly it encompasses two separate components. 00:11:33:01 - 00:12:04:05 Unknown One is the customer usage part and the second is the technology part. So from a customer usage perspective, what people want to do is bring in their knowledge bases. And by that I mean bring in the PDF documents, PowerPoint documents, so on and so forth, and ask questions of that or get summaries of that text, or translate that text into another language, or develop a chatbot around it so that they can get answers, things of that nature. 00:12:04:07 - 00:12:36:19 Unknown So what we have done is keeping this in mind for an enterprise setting. We have developed the technology components which are needed to serve these needs, such that going back to the earlier conversation, they focus on their business needs, and we provide them the tools to actually, serve those or we provide the plumbing to do so. So what we have done is to provide a full pipeline to ingest their documents and create the knowledge base. 00:12:36:19 - 00:13:04:06 Unknown And by that I mean bring in your PDF documents, which will get converted into embeddings and stored into vector store. So we provide all of that. And then we provide ways in which to search this knowledge base and give answers to the users via easy to use APIs like retrieval augmented generation (RAG), or doing just semantic search over those documents or doing summarization or translation. 00:13:04:08 - 00:13:32:14 Unknown We also have the ability to, support chat. Now, one very interesting thing that we have done is to provide the users a LLMs, which ran on commodity hardware. Jayant was talking about running this on HeatWave nodes. So we, we, we have provided these LLMs which found on commodity hardware so that people can quickly prototype their application to test it out. 00:13:32:16 - 00:14:00:13 Unknown And if they like the results of the like the performance, they stick with it. Or if they want high performance, and then they go to our OCI GenAI services and use those LLMs. So, quick prototyping, quick testing, quick evaluation done using the commodity LLMs commodity, the LLMs which are running on commodity hardware. And then they can use the OCI GenAI LLMs to get high performance. 00:14:00:15 - 00:14:26:07 Unknown Now going to your question about what newer things are coming in. You know, OCI is at the forefront of this revolution. And they are providing, newer models, newer frameworks and tools. And we are continuously incorporating MySQL and Heatwave with, their tools and technologies so that we can provide the same to our customers, in coming weeks and months. 00:14:26:08 - 00:14:53:23 Unknown Yeah. And then an example would be the agent framework. Right. Integrating with the agent framework, integrating with the hosted frontier models on GPU infrastructure. So you develop your prototype, develop, you can choose to deploy. The integration is preexisting. You don't it's not an after the fact exercise. You use the same infrastructure. And we provide the pre-built integration with those AI services. 00:14:54:01 - 00:15:28:22 Unknown Excellent. So because you are talking about, using, these, LLMs, on commodity hardware, which model, are available from these LLMs and how are we using them? So, we provide as I mentioned earlier, we provide two sets of, access to two sets pof LLMs. One are the smaller parameters LLMs, which are running on, commodity hardware on the same cluster on which HeatWave is run. 00:15:29:00 - 00:16:00:12 Unknown And in that context, we provide models like, mistral 8 billion or, llama 3 billion, 1 billion. And we are continually sort of upgrading these as newer models come into play, since this is a very fast moving fleet. Also, once people have prototyped it, once people have gotten to see the results, if they want to switch over to OCI GenAI LLMs, then we provide access to the wide variety of LLMs that OCI provides. 00:16:00:14 - 00:16:31:11 Unknown Those includes, you know, things like all the llama models, the upcoming models from all the other vendors, all of that is going to be accessible through, all of that is accessible through, MySQL. And these can be used for, as I said, but I do have use cases, be it retrieval augmented generation, chat bots, summarization, translation and many other use cases that, you know, customers continue to think of. 00:16:31:11 - 00:16:57:06 Unknown And we are always amazed at the way they are using, this technology. So when we talked to Matt Quinn, in our first episode, he had mentioned, he briefly touched on, security and privacy issues when it comes to GenAI or when it comes to AI. And we all know that OCI's, biggest concerns are and always have been, security and privacy. 00:16:57:07 - 00:17:26:12 Unknown How is data privacy managed when training models or generating text using GenAI in HeatWave? Thank you for that, Scott. So security and privacy and data privacy are and have always been Oracle's primary concern. Right. It is true for the OCI the infrastructure. And it was built with security in mind. It has been true of Oracle's ever since Oracle since its inception. 00:17:26:14 - 00:18:04:09 Unknown So that is core that is our DNA including for MySQL and Heatwave. So the two primary ways in which this is a benefit here. Number one, the data doesn't move. Number two when it does have to move, as you said, in the case of text generation etc., we don't use only send the relevant snippets. So Sanjay talked about the fact is that you just the documents and then you create embeddings, which essentially that means is you look at various snippets, the model creates a semantic, you know, captures the semantics of it. 00:18:04:09 - 00:18:33:08 Unknown So that you can do a similarity. Right. Do you want to search for, GenAI? You'll also get documents that talk about retrieval augmented generation. Even though you didn't ask about retrieval augmented generation, because the embeddings know that these two things are related. So we extract the relevant content and only that is then sent. For example, if you want to if you're using in database a LLMs nothing is sent anywhere, everything stays in your environment. 00:18:33:09 - 00:18:58:04 Unknown If you're using an OCI service, for example, the relevant pieces sent that you get the answer and OCI is built, as you said, with security in mind. So the relevant piece that you sent is discarded after it has answer your question. So once again, your data doesn't leave the ecosystem. It stays within the Oracle infrastructure. Your infrastructure. Great and secure like we want. 00:18:58:06 - 00:19:29:03 Unknown Exactly. So thank you very much. No. The tricky question that, we have every time we present, anything new or compelling, at conference and stuff, it's that. And I would like to ask you if you could share, with us, a few compelling customer stories or a real world application of, the, ML engineand GenAI in HeatWave, because it's bice theoretically. 00:19:29:03 - 00:19:47:23 Unknown But are people using it? Really? And who are they? If you can say, I know we cannot always share names, but maybe some, are public. So if you can explain us a bit of that would be great. Yeah, Fred, that's a very good question. And indeed, all of this makes sense if customers are using it. 00:19:48:01 - 00:20:31:14 Unknown So, yeah, let me share a couple of them. In fact, let me share two of them, one on GenAI and one on, AutoML, so on GenAI. And this is a public reference. We have a customers, Smarter D their goal, or what they provide is a platform to track the compliance of a company's policy to the guidelines released by various, bodies, for example, DoD. now, in the past, what they have done is to have an auditor, which is going to compare the policy with respect to the guideline and then say if a specific companies are adhering to those guidelines or not, which is kind of 00:20:31:14 - 00:21:04:12 Unknown a, manual process, fairly intensive, right? What they did was to start using, Heatwave GenAI and they stored both the policies and the guidelines in HeatWave GenAI's knowledge based. And then using our RAG using our other capabilities, they compared the policy with the guidance automatically. And the results of these are then given to the auditor. 00:21:04:14 - 00:21:28:15 Unknown So now what does what does this do. This reduces the amount of time the auditor has to spend comparing the two. Instead, he or she gets a, very short description of what has been done, what has not been done, and whereby they can make a judgment whether it is indeed adhering or not. So it makes the auditor far more efficient compared to earlier. 00:21:28:17 - 00:21:57:04 Unknown So that's a that's a use case in the compliance space for HeatWave. GenAI, I, I would give me another example here. I can't, quote the name, but this is in the industrial setting where the customer wants to have early signals of failures of their computer servers. So they are running, large amount of, workloads. 00:21:57:07 - 00:22:39:20 Unknown And these workloads generate logs, and those logs sometimes are indicative of failures. So what they do is they use our log anomaly detection, tool models that, they have built actually, and feed their logs continuously to these models. And these models are trained to detect abnormal logs. So when the model notices abnormal logs, it's going to go ahead and give you an early warning of the fact that a particular system is likely to fail. 00:22:39:22 - 00:23:03:14 Unknown An example of that would be hey, your CPU utilization is trending higher, or your memory utilization is going up, or the number of network connections are going up. All of this the model best efforts based on the data it was trained earlier and then in real time. It looks at the newer data and figures out, you know what, this is looking problematic. 00:23:03:20 - 00:23:29:22 Unknown So let me give you an early warning. And that's what they are doing. So these are two, kind of use cases that some of our customers are two of our customers of, have developed and, use. Great. So just to add to what they were saying on the second case, right, it's high tech manufacturing, their assembly line, the manufacturing process, which is if something goes wrong and that process is interrupted, that's expensive. 00:23:30:00 - 00:23:52:13 Unknown Right. And that is what is being, the benefit here is that they get an early warning and they can mitigate, so they get a chance. They have monitoring systems. Obviously, the monitoring system tells you after the fact something went wrong. Go fix it. The log analytics, the predictive part gives you an early warning so you can mitigate the likely. 00:23:52:15 - 00:24:21:14 Unknown Yeah, you can either prevent or you can reduce the effect of something getting affected on the assembly line. That is really cool. So it's interesting that, Sanjay, you talked about the compliance issue because when we talked to Matt in the last episode, that was actually an example that he used as well, but he was just talking generally not specific, you know, not a specific, customer or, real world, application of it. 00:24:21:14 - 00:24:50:06 Unknown He was just throwing it out there like, this is what you could do. So it's really cool that you brought that up to, to wrap up, what advice would you give to MySQL users who want to start exploring AI with Heatwave? Simple. Get started with the always free instance. So today, and in case y'all aren't aware and folks aren't aware, you can get an always free instance in your own region in OCI. 00:24:50:06 - 00:25:14:15 Unknown If you, you know, sign up on OCI account and create an always free your instance. And that actually has the GenAI and AutoML capability. Now to help you get started, we've got live labs that walk you through scenarios. Right. So we've got 4 or 5 live labs - two on GenAI, two on ML and a few on specific use cases. 00:25:14:15 - 00:25:45:16 Unknown For example, wholesale SailGP uses machine learning with MySQL to help these people perform better in a race. But more importantly, once you we also have a collection of videos. Some of them by you Scott, for example, on what you can do with AutoML, what you can do with GenAI, what you can do with it in healthcare, what can you do with it in e-commerce and etc.. 00:25:45:16 - 00:26:07:00 Unknown So and sample code on GitHub on our GitHub site, sample notebooks, sample SQL code. Try them out on your always free instance. Or if you have access to a beefier instance because you have credits. Try it out on that. And then if you have a specific thing in mind, feel free to reach out to any one of us. 00:26:07:00 - 00:26:25:06 Unknown If you want help to do a prototype or do a POC, we'd be more than happy to do that. Nice. Thank you very much. So it was a pleasure to have you both here, Sanjay and Jayant. So thank you very much. To participate to this podcast. So thank you and bye bye. Bye bye. Thank you. Bye bye. Thank you. 00:26:25:06 - 00:26:53:23 Unknown Guys, that's a wrap on this episode of MySQL: Sakila Speaks. Thanks for hanging out with us. If you enjoyed listening, please click subscribe to get all the latest episodes. We would also love your reviews and ratings on your podcast app. Be sure to join us for the next episode of MySQL: Sakila Speaks.

7. Aug. 2025 - 26 min
Episode AI for the Rest of Us: A High-Level Overview Cover

AI for the Rest of Us: A High-Level Overview

Kick off Season 3 of Inside MySQL: Sakila Speaks as leFred and Scott welcome Matt Quinn for an engaging introduction to the world of Artificial Intelligence. In this episode, we step back from the database and explore what AI really is, how it's shaping society and technology, and why it matters to anyone in tech today. Whether you're just curious about AI or eager to understand its key concepts, join us as we break down the basics and set the stage for a season of discovery. ------------------------------------------------------------ Episode Transcript: 00:00:00:00 - 00:00:31:22 Welcome to Inside MySQL: Sakila Speaks. A podcast dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL project updates and insightful interviews with members of the MySQL community. Sit back and enjoy as your hosts bring you the latest updates on your favorite open source database. Let's get started! 00:00:32:00 - 00:00:58:22 Hello and welcome to Sakila Speaks, the podcast dedicated to MySQL. I am leFred and I'm Scott Stroz. Join us today. It's Matt Quinn, vice president and head of AI at Orracle. Matt leads how Oracle Cloud Infrastructure's AI services are adopted by customers in EMEA. Matt brings deep expertise in enterprise software strategy and a passion for making AI both powerful and its adoption practical. 00:00:59:00 - 00:01:21:03 Today he is here to help us unpack what GenAI really means for the organizations we work for and buy from, and what it means for developers, data professionals, and MySQL users everywhere. Matt, welcome to Inside MySQL: Sakila Speaks. It's great to have you with us to kick off season three of our podcast. Thank you very much, Fred, Scott, great to be with you. 00:01:21:08 - 00:01:43:21 Looking forward to, to an interesting conversation and getting us going for season three. Awesome. Matt, thanks for being here with us. So right off the bat, when most people hear the term AI, they probably think of chat bots. But that's just one form of AI. Can you help provide us with like a high overview of the different types of AI that exist? 00:01:43:23 - 00:02:15:10 Absolutely. And I think AI and itself is a broad church, right? There's a number of different, kinds of AI. The term actually dates back to the 1950s as a concept for you know, machine thinking. It's had a couple of false dawns over the time when compute and data to train. I wasn't really quite ready for this, but as we got into the 90s and the early noughties, as compute power grew, as storage grew, a confluence of internet accessibility, lots of data becoming available, and then we time fed forward. 00:02:15:12 - 00:02:33:12 We found that organizations could do the fundamentals of what we know of AI today things like machine learning. So learning a trend and a pattern, looking at what happened in the past and do a statistical regression on that to predict some future outcome based on what happened in the past. And we use examples of this today without even knowing it. 00:02:33:12 - 00:02:52:11 You know, is this email that's coming into my email system, is this spam or not spam? Those kind to classifier types of AI have been prevalent for the last ten, 15, 20 years, and we're moving forward to where AI has this more kind of human interaction. It's surfacing and it's suddenly popped into the zeitgeist, for for conversation. 00:02:52:15 - 00:03:14:03 So it has multiple facets. We have machine learning trained something to do, something very specific, show it, something that it's seen before and enable it to predict the future based on what it's learned. But we're starting to see this wave of generative AI do more advanced, more nuanced, more humanlike things, and I think that's a really powerful kind of inflection point that we've seen in the last two, three years. 00:03:14:05 - 00:03:39:02 Thank you. So because in your first, answer, you said you said about the 70s and 90s, but why is I having such a huge moment right now? So what changed since that time? I think that the real inflection point is the the kind of conversational nature of it. You can speak human to it, and it can speak human back to you. 00:03:39:04 - 00:04:01:13 If I think about how compute evolved, you know, it used to be I had to type cryptic commands on the green screen in order to be able to use a computer, which meant the audience of people who could use computer to do something was very limited. In the 80s is the GUI. The graphical user interface kind of emerged suddenly it was a keyboard in a mouse, and the population of people who could interact with the computer was much broader. 00:04:01:15 - 00:04:19:02 Mobile did the same for us, but you still had to learn things. You had to take the human to interact in a way that made sense to the computer. With generative AI, I think what's happened in the last 2 or 3 years is actually the computer is coming to meet the human. Suddenly it's able to interact with us in our language. 00:04:19:07 - 00:04:37:19 I can have a conversation with it. I can ask a question in natural language. Now I might need to engineer my prompts to get the right kind of outcomes to guide it. Actually, the computer understands what I say. It can meet my language and understand that interact with me in a very human way. And I think that's caught the imagination of people. 00:04:37:19 - 00:04:59:18 They've suddenly had this 'aha' moment and that then has gone from, you know, an academic or data or IT kind of problem. It's broken out of it and gone into the board to say, well, actually, what does this mean? How will this work? And as people start to imagine what it could do beyond, you know, asking a question about, you know, what recipe do I have? 00:04:59:18 - 00:05:20:13 Or how can I find an answer to a question I could historic could use a search engine for, but save me some heavy lifting organizations to look at it and say, oh, hang on a minute. What manual processes in my organization...What low value repetitive tasks are happening in my organization that this might help me change? So suddenly AI has gone from being an IT conversation to being a business conversation. 00:05:20:13 - 00:05:48:15 It's it's got the opportunity. It's got the ear of the board. And suddenly that's just pivoted the demand and the interest in AI I think in the last couple of years. That is quite insightful. So because I has become the big thing in the world and everybody is talking about AI, there's got to be some, some common myths or misconceptions about AI out there that you've heard give us one or a couple that you've you've heard that you need to clear up and be like, that's not actually the case. 00:05:48:17 - 00:06:11:19 So there's a couple of things that I think, reoccurring in the conversations I have with customers, with, with engineers, with particularly people outside of IT. And one of those is around privacy. And I think that the challenge that we have with AI is the first services that really burst this into the public domain. There's kind of ChatGPT services. 00:06:11:19 - 00:06:31:04 There's first, opportunity where you could just go to a website, sign up for free, try something for free, engage with it and have a human like conversation. But that spread like wildfire, like 100 million users in a crazy amount of time. The interesting thing there is that free service, and I always like the phrase if something's free, you are the product. 00:06:31:06 - 00:06:54:21 That's those kinds of public sites where it's, you know, it's a consumer-grade service. There's no charge. The huge costs sitting underneath those models, like running the infrastructure, running the applications, having train the models. So the reality is in that environment, the value exchange it was happening is the prompts that I give that free service are available to be used to retrain the model to extend it, to make the product better. 00:06:54:23 - 00:07:24:01 So you're giving access to the data that you provide through a prompt to the service provider that is running that service. That's the value exchange. Now that's created this perception in people's minds that AI isn't private or safe or secure. And I think the reality is, when you do this in an enterprise context, you can absolutely run those models in a ring fenced way, the same way you'd run a database platform where it's isolated. 00:07:24:07 - 00:07:41:09 It's not sending data back to the model provider, it's secure and it's yours. And that enables you to do things. Bring your private data combined with the intelligence that the model has been trained on with public data. And that's what builds builds a system. But it doesn't have to be a system where you're losing control of that data. 00:07:41:13 - 00:08:02:22 So I think there's a lot of FUD around that fear, uncertainty and doubt. And it's up to us as technologists to help dispel the myths and separate where that might be happening in certain domains. That free service is public services. Maybe that is happening, but in an enterprise it scenario, you absolutely can put the security and privacy guardrails around it to meet the kind of enterprise controls that you'd expect. 00:08:03:00 - 00:08:36:10 Whilst reaping the benefits of the AI productivity gains, that you could have. So I think that that to me is the big one. Awesome. Thank you. So, because you said that, you you talked about AI, in industries, and how it's used. And I really like the analogy with the database. So for us, with MySQL, we really enjoy, the databases, could you, paint a picture of how AI is being used across the industries, or is it just specific, or can we use it, in different ways? 00:08:36:10 - 00:08:58:07 And, now it's a great question. I think, like most technological innovations, the thing that is most disruptive about AI is it has an opportunity to be a general purpose technology. And so if I think about things like the internet and electricity before it, electricity is a general purpose technology, right? It's one thing that it is it's ubiquitous in society. 00:08:58:09 - 00:09:21:11 But it's used for many things. Right? It's used for the lights in my room, for the microphone, the router that's routing this, this conversation to you. It's also used to heat my house. It's it's used to generate, to run factories. It's a general purpose technology. The beauty of that is it's power and it's ubiquity and it's up is only constrained by the imagination of people who take electricity and think about what problems could I solve with it? 00:09:21:13 - 00:09:46:16 I think I will be very similar to that. But it's up to us in industry, in technology to invent ways to use this, that are productive, that deliver value for our organizations or for society at large. And the real opportunity there is is boundless, is captured only by our imagination. What I am seeing is there at the very specific first mover type, use cases that are happening. 00:09:46:20 - 00:10:08:10 And they might be things like, you know, drug discovery and protein folding, like highly academic, data science led things. They're moving really fast because those are things where data scientists were already doing lots of work, they were already doing machine learning. They were already up and running with AI. What I'm noticing is that enterprise adoption is a different kind of material, right? 00:10:08:10 - 00:10:30:00 It's a different kind of IT problem to go solve. So what we're seeing is enterprises are experimenting. They're doing lots of pilots. They're they're kind of engaging in, you know, the art of the possible. How could we use this in our organization? What things do we not know about how to do this? We haven't trained our organizational muscle to be able to go from idea to pilot to production yet. 00:10:30:02 - 00:10:51:11 So what I'm seeing is organizations look at human in the loop scenarios. So they're starting with applications where AI is helping a task that already happens to happen a bit more efficiently, a bit more effectively, or drive more coverage. And my favorite example of this was, is in regulated industries where actually, you know, organizations are a bit fearful of upsetting the regulator. 00:10:51:11 - 00:11:10:22 And, you know, we're using AI. And what's the governance challenge with this? I work with a few organizations. You've actually turned that on his head. And what they've said is, how can we use AI to improve our compliance, and regulatory frameworks. So they were looking at this and saying, well, you know, today we have a contact center and we have a team that listen to all the recordings. 00:11:10:22 - 00:11:30:19 Well, actually, they listen to 5%. They sample the recordings and they look for compliance challenges. And then they use that to inform how they educate people and report with compliance, status. So they said, well, actually, why don't we have I listen to all of the calls and then the team that were previously only listening to 5% can go and mark the AI's homework. 00:11:30:21 - 00:11:47:08 And this creates value because now I've improved my compliance perspective on screening all of my phone calls. And the people who are listening to those calls and marking the AI's homework, they can improve and iterate on the model and make it better over time. So we have that human in the loop. So it's augmenting the capability of a team to do something and improving the outcomes for the organization. 00:11:47:14 - 00:12:09:11 I think when you start with use cases that are in that kind of domain, the organization can learn, can adapt and then understand how do I apply this to other problems. And it really has to come from what's the biggest problem in my organization? What's my strategic objective? How does that relate to a data strategy, to an AI strategy to go solve those business problems I want to solve? 00:12:09:14 - 00:12:31:19 And that's the real connective tissue here. It's not a science experiment. It's not AI for the sake of it, just like it wasn't data for the sake of it. It's about data to solve a business problem, help us take action in our organizations. That's awesome. So the three of us obviously work for Oracle, and there's been a lot of news about what Oracle wants to do in terms of AI. 00:12:31:19 - 00:12:58:02 And, you know, are we currently a significant player in the AI world or are we going to get there eventually, do you think or, you know, is it is there is there some other path for Oracle in terms of AI? I think Oracle has a unique position in a number of ways. So if we think about the news that you're talking about yeah, there's lots in the press today about the huge investments that we're making, the giant partnerships that we're doing. 00:12:58:04 - 00:13:21:13 These are about the industrial scale infrastructure that will be needed by organizations, both to train the next generation of these models, but equally to run and inference them. So if you're an organization that wants to consume AI, you want to do that scale. You need that bulletproof, high performance, low latency infrastructure that is secure and robust in order to run the workloads that are powered by AI. 00:13:21:15 - 00:13:43:20 If you're going to do this in an enterprise fashion, you're going to want to do that in a robust, secure, resilient fashion. So building out that infrastructure, the Oracle Cloud infrastructure that we have today, the strong partnerships we have with, GPU providers and software vendors like Nvidia, these are the kind of raw foundational capabilities at absolutely epic scale that are critical to this success 00:13:43:21 - 00:14:21:20 in AI and Oracle's right at the front of that. Interestingly, though, it's not just about tin in data centers. It's about the software stack. It's about the ability to take that raw compute and augment it securely with robust data practices, bring data into the world, bring AI to where that data lives today. That's where I see Oracle being really powerful between huge database platforms from Oracle to relational database platform to MySQL, these are key capabilities and your key software assets that will help organizations unlock the power of that infrastructure and bring it to life in their organization. 00:14:22:00 - 00:14:53:00 And then at the other end of the spectrum, you have, SAS applications at fusion. These are the business process tools, the systems of record, the organizations trust to do work for their organization. They have key elements of data, and they operate they run business processes in your organization. So the ability to surface the outputs of the AI and applications that business users use so they can understand it, use the, you know, interact with the data, glean insights from it, leverage the power of AI to take action for and with them. 00:14:53:02 - 00:15:13:21 That that combination right across the stack, I think is where Oracle is uniquely positioned. And and hence,I am here. Excellent. Yeah. Very nice to see that Oracle actually is a big player in the AI and had the opportunity to see plenty of, of stuff on that tool like the data centers, who we created the, for it. 00:15:14:03 - 00:15:44:00 And, so and yeah, in MySQL, we, we already also see with MySQL HeatWave what brings to to AI there. But, so with that position of of Oracle, going on lot on AI, do you think it will impact, the, the product portfolio, of Oracle, like some stuff to, like, MySQL we know about it, but for other products, do you think that will also impact them? 00:15:44:02 - 00:16:08:19 This, this role of Oracle in the industry? I think it will. I think it will it will bring a new gravity to, to the solutions that we offer. I think the other component, and you're seeing it with MySQL, you see it with Oracle, you know, actually, how do we take the greatness of the database platforms that we have and extend that to simplify organizations use of new technologies? 00:16:08:23 - 00:16:31:13 And you know, my favorite example of this is how do you enable the existing database to do more of the tasks you need in an AI world? So with that, I'm thinking about vectorization, storage of vectors, the ability to run inferencing close to the data. I don't have to pull all my data out of the database just to then run some inferencing over it. 00:16:31:18 - 00:16:58:15 How do I bring that AI capability directly to where the data lives? So I think we're seeing that with lots of the product innovations. And we're also thinking about like what does this mean to governess. You know, if you have a solution where, you know, you've become used to as an organization governing and managing a relational database, how do I then work in a world where I have unstructured data, structured data I have now vectors, it's these are all living in different store data stores. 00:16:58:17 - 00:17:13:12 How do I govern and control that? How do I make sure that I'm keeping that data in in sync? How do I make sure that I've got my GDPR compliance correct? A customer wants to be forgotten. I've now got more places that I need to forget. The customer. I, you know, update that data because it has to be correct. 00:17:13:14 - 00:17:42:02 So I think this concept and we see it across the MySQL platform, we see it across Oracle database. Actually by bringing the vectorization, the vector storage, the vector generation, the the ability to query right into the database engine, you simplify the operational management, you simplify the governance of that model. It makes it easier to secure, to manage access in ways that your organization is already familiar with, by managing a MySQL estate or by managing an Oracle platform. 00:17:42:06 - 00:18:04:09 So so suddenly you're able to expand the scope of the things that you do without it bringing extra operational and governance, complexity into your organization. So it's already influencing our product portfolio. It's already changing the way that we expand to help organizations take advantage of these new needs, these new demands and services, but bring those in a way that makes them part of the existing ecosystem. 00:18:04:09 - 00:18:22:15 They're using the Oracle. And of course, that will continue to evolve in ways that, you know, if I had a crystal ball, I would I'd be looking at what those might look like. But, you know, the key here is that we're moving early, we're moving fast, and we're learning from those, demands and evolving products to help organizations gain the value of that. 00:18:22:15 - 00:18:49:14 They don't have to invent all of these capabilities themselves. They can consume them baked into the products they already used. So I know from the MySQL side that we have customers who have terabytes or petabytes of data. What role is is that data going to play in building or benefiting from AI? And again, I'm talking particularly about like structured data that would be in a MySQL database. 00:18:49:16 - 00:19:12:13 Got it. So so if I think about that, that kind of structured data often that's going to be data that represents entities or processes in your organization. Right. It is the state of a process or of, of an entity, a customer, an order fulfillment, something that exists in the real world projected into a piece of data in that database. 00:19:12:15 - 00:19:54:03 And if we want AI to be a part of how business gets things done, runs a business process, it's going to need to have secure, robust access to well-trusted, grounded data that represents the real world. And I think that key is where AI, in the large language model, the kind of ChatGPT I can interact with it, I can have a conversation with a process that's that's trained and kind of sealed in its data set that it was trained on, but it brings in intelligence that helps it understand a question, interpret language to, perhaps reason over some of the, the, the assets that you've given it as part of that prompt where it becomes 00:19:54:03 - 00:20:16:23 really powerful is in the process is this commonly been referred to as RAG or resource augmented generation. This is the ability to take your private data and securely add it effectively to the prompt. So you add lots of context to the question that you asked the model. Now I can use its intelligence and the ability to understand based on the public data it was trained on. 00:20:17:01 - 00:20:37:21 And in response to the question that you've asked, it can also now answer that ground in your in your own data. So if that data is the structured data, you know, it's about an order, it's about a product description, it's about a fulfillment or an employee. Then suddenly you have the ability to look at that private data set and reason over it using the intelligence from the large language model. 00:20:37:23 - 00:21:04:08 So data will be fundamental because data represents the real world. Data represents the things that we want our business to do. So if you can bring that data, enable it to be composed with that large language model, with the AI, then the AI suddenly can do things in our organization. It can provide insights into our organization. Or if we think more about agenetic AI, it can start to take action, force or recommend actions enable things for us to be done on our behalf. 00:21:04:10 - 00:21:27:17 I think that's where we start to see the flywheel really turn structured data that represents business processes powered by large language models and simplifying the way that that kind of ecosystem, combines. That's where we'll unlock real business value. Enterprise value, versus helping me my homework. That's great. So thank you very much, for that information. It's very insightful. 00:21:27:19 - 00:22:04:15 So yeah, I'm very happy to, to, to, to start this, this third season, with you, Matt, about, AI and we will see in the future, episodes, also in the next one, everything related to and more in with MySQL, of course. But, I think it's very, very interesting time, for people to test AI and, for the people who will listen to us that they can play, on OCI with HeatWave there is a free, HeatWave that has also, AI capabilities. 00:22:04:18 - 00:22:27:11 There is also the, OCI, GenAI service that can be, useful to play with. I, play with the, with both of them. And it's very, very, very interesting and, surprising. Oh. It works. I don't know if you have something else to add for us, but, we were I was very happy to to to chat with you. 00:22:27:13 - 00:22:48:16 But I I'll challenge one thing, Fred, that you said, and I'll be slightly cheeky on it, but playing with it and experimenting, it is step one to learning what can be done, but will only really learn how to do this when we start to practically apply it to real world problems. So we need to move from this kind of experimentation and pilot phase. 00:22:48:20 - 00:23:05:16 That has to happen. And as individuals, as technologists, we will have to do that to learn and get to grips with this technology. But we do need to find ways as organizations to actually do this in anger. And I think, you know, I always use the phrase if you if you want to run a marathon, you start by getting up and running the five K, right. 00:23:05:20 - 00:23:23:02 And it's you do a real run and it hurts. It hurts like hell when when you do that first one. But it becomes easier as you do more of them and you start to expand scope. You start to get longer. You can do bigger runs. Organizations need to do that same piece and train the organizational muscle. You'll do it with real world projects. 00:23:23:04 - 00:23:42:13 We absolutely need to learn how to do this and experiment and learn. But the best way that an organization can, can learn to do this quickly is to find a real world problem to solve and work back from. Why does this organization need to use AI to do this? What problem can it solve for us? And then think about how can AI help us do it? 00:23:42:16 - 00:24:10:08 I think we can get that flywheel going. The playing with it will inspire us. But that's not the end game, right? That's just this chapter one. Yeah. The playing around I think, feeds the, the, the need for lack of a better word that, you know, a lot of times, like I come from a developer background and a lot of times, customers or clients didn't necessarily ask us for what they wanted. 00:24:10:11 - 00:24:36:05 They asked us for what they thought we could deliver. So the Henry Cole thing. Right. I want a faster horse. Exactly. So instead of, you know, saying, hey, this is the problem we have, they're like, well, this is how we think you can solve it. And I think AI is kind of the same thing where people don't really know what the capabilities are, or they're asking for capabilities that they think are the limits of the AI rather than the capabilities that they actually want. 00:24:36:07 - 00:24:52:02 And I think we're going to get to a point probably, sooner rather than later, that we're going to realize that AI can help us with a lot more stuff than what we think it can do right now. Definitely. And it will it will meet in the middle. Right. The business will be saying, I've got these problems I want to solve. 00:24:52:02 - 00:25:17:13 And I think AI, as part of the solution, and developers and technologists who've taken the time and invested the energy to go learn the technology, to see the are the possible, they can then be inspired by the kinds of things that happen when those two meet in the middle. That's where we'll see a real innovation coming in organizations doing really clever things and taking the great products and services that we've built on Oracle Cloud infrastructure in MySQL, in HeatWave, in Oracle database. 00:25:17:14 - 00:25:36:04 The main aim of those is to make it simpler for organizations to take those ideas very quickly, pilot them and prove value. But it's not about piloting them in isolation. We need no cliffs, right? We need to get to the point where when that pilot's ready, we can securely robustly, deliver that into production and we can scale it. 00:25:36:09 - 00:25:56:19 I think doing these, experiments in these enterprise scale, frameworks, in the tools that we provide that gives organizations a route from pilot to production. And that's the bit that I think organizations are really craving. And it's a we we're about to see a real inflection point on that fantastic. Matt, again, thank you for joining us. 00:25:56:21 - 00:26:10:14 I think this has been a great conversation, and I really think that our listeners are going to get a lot out of it, and hopefully it whets their appetite to learn more about AI in upcoming episodes. Thank you very much for having me. Great, great to talk with you. I look forward to listening to all of their story. 00:26:10:16 - 00:26:30:06 Thank you, thank you. That's a wrap on this episode of Inside MySQL: Sakila Speaks. Thanks for hanging out with us. If you enjoyed listening, please click subscribe to get all the latest episodes. We would also love your reviews and ratings on your podcast app. Be sure to join us for the next episode of Inside MySQL: Sakila Speaks.

25. Juli 2025 - 26 min
Super gut, sehr abwechslungsreich Podimo kann man nur weiterempfehlen
Super gut, sehr abwechslungsreich Podimo kann man nur weiterempfehlen
Ich liebe Podcasts, Hörbücher u. -spiele, Dokus usw. Hier habe ich genügend Auswahl. Macht 👍 weiter so

Wähle dein Abonnement

Am beliebtesten

Begrenztes Angebot

Premium

20 Stunden Hörbücher

  • Podcasts nur bei Podimo

  • Keine Werbung in Podimo Podcasts

  • Jederzeit kündbar

2 Monate für 1 €
Dann 4,99 € / Monat

Loslegen

Premium Plus

100 Stunden Hörbücher

  • Podcasts nur bei Podimo

  • Keine Werbung in Podimo Podcasts

  • Jederzeit kündbar

30 Tage kostenlos testen
Dann 13,99 € / monat

Kostenlos testen

Nur bei Podimo

Beliebte Hörbücher

Häufig gestellte Fragen

Weitere Fragen und Antworten
Loslegen

2 Monate für 1 €. Dann 4,99 € / Monat. Jederzeit kündbar.