Inside MySQL: Sakila Speaks

Acerca de Inside MySQL: Sakila Speaks

The Inside MySQL, Sakila Speaks podcast is dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL product updates, and inciteful interviews with members of the MySQL Community. Sit back and enjoy as your hosts, Fred Descamps and Scott Stroz, bring you the latest updates on your favorite open-source database.

HeatWave Hot Takes: The Power of ML and GenAI

In this episode, leFred and Scott welcome Jayant Sharma and Sanjay Jinturkar to the Sakila Studio for an insightful conversation on machine learning and generative AI within HeatWave. Discover how these cutting-edge technologies are integrated, what makes HeatWave unique, and how organizations can leverage its capabilities to unlock new possibilities in data and AI. Tune in for practical insights, real-world use cases, and a closer look at the future of analytics. ------------------------------------------------------------ Episode Transcript: 00:00:00:00 - 00:00:32:01 Welcome to Inside MySQL: Sakila Speaks. A podcast dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL project updates and insightful interviews with members of the MySQL community. Sit back and enjoy as your hosts bring you the latest updates on your favorite open source database. Let's get started! 00:00:32:03 - 00:00:54:17 Hello and welcome to Sakila Speaks, the podcast dedicated to MySQL. I am leFred and I'm Scott Stroz. Today for the second episode of season three dedicated on AI. I am pleased to welcome Sanjay Jinturkar. Sorry if I pronounce it badly. No, you did it right. Hi there. Thank you. So Sanjay is the senior director at Oracle based in New Jersey. 00:00:54:19 - 00:01:21:13 He leads product development for it with AutoML and GenAI with a strong focus on integrating these technologies directly into each HeatWave database. And Sanjay has been instrumental in enhancing HeatWave's machine learning and GenAI tool sets, enabling use case like predictive maintenance, fraud detection and intelligent dicument and Q&A. And also we have a second guest today. 00:01:21:13 - 00:01:48:21 It's a Jayant Sharma. Hi, Jayant. Hello. So Jayant Sharma is senior director of product management at Oracle. He has over 20 years of experience in databases, spatial analytics and application development. He's currently focused on the product strategy and design of the Heatwave MySQL managed services offering. Hey Fred. Thank you, both of you for joining us today. So I'm going to dive right in with the question for Jayant. 00:01:48:23 - 00:02:12:14 Why did Oracle decide to integrate machine learning in generative AI capabilities directly into HeatWave? Thank you Scott, first for this opportunity. And yes, we have to start with first, you know, talking about MySQL, right? MySQL is the world's most popular open source database. And what do all of these customers, the thousands of customers that they have, do with it? 00:02:12:16 - 00:02:47:05 They manage a business process. They manage their enterprise, right? Their focus is on what they want to do, why they want to do it, and not so much the how. That's what MySQL makes it easier. And Heatwave is a managed service on MySQL. Okay, so as folks are modernizing their applications, taking advantage of new technology, they want to be able to use new workloads, new analytics, and modernize their business processes, make it more efficient, make it more effective. 00:02:47:07 - 00:03:09:17 In order to do that, they want to do things such as machine learning and use the benefits of generative AI. However, what they want to focus on, as we said, is what they want, why they want to do it and not the how. So they don't want to have to think about. I have all of this data that's potentially a goldmine. 00:03:09:19 - 00:03:40:07 How do I extract nuggets from it, and how do I safely move it and transfer in between the best of breed tools? I want to be able to do things where they are. I want to bring the capabilities, these new capabilities to my data. I don't want to take my data to where those capabilities are exposed, right? That is why we made it possible to do machine learning and GenAI where your gold mine is, where your data is in MySQL in Heatwave. 00:03:40:09 - 00:04:06:07 Awesome. Thank you. So, I would like to ask you to Sanjay, then. How Do the the, machine learning engine in the HeatWave, offer differ from, using external machine learning pipelines with the with the data we have in the database? It differs in a couple of weeks, specifically how the models are built, who builds them and where they are built. 00:04:06:09 - 00:04:46:09 So our pipeline, we provide, automated pipeline, which can take your data in MySQL database or Lakehouse, and then automatically generate the model for you. So it does the, usual tasks of pre-processing, hyperparameter optimization, and, data cleansing, etc. automatically so that the user doesn't have to do that. We would even go ahead and do, explanations for you in certain use cases, given that this is automated, a big side effect of that is users don't need to be experts in machine learning. 00:04:46:11 - 00:05:16:08 What they need to focus on is their business problem, and how that business problem maps onto one of the features that we provide. From there onwards, the pipeline takes over and generates the models for it. And the third piece is that all of this work is done within HeatWave. We don't take the data going back to what Jayant was say, saying, we have got machine learning and generative AI to where the data resides, not the other way around. 00:05:16:10 - 00:05:47:20 So we are building the models inside Heatwave whereby the data is not taken out and thereby it is more secure and the user does not have to worry about data leakage or track where all they have taken the data and how many times they have done it. So these are the three key ways in which we differ. If you use one of the third party solutions, they will end up asking you to do this on your own or asking you to take the data out of the database and build it on your machine, so on and so forth. 00:05:47:22 - 00:06:21:06 But we have made it automated, easy to use and very secure to do so. So Sanjay, we're going to stay with you to, to keep talking about AutoML in HeatWave. So what are some of the key features of AutoML and how does it simplify model training and deployment for users? Fantastic question. You know, as I said in my in the previous, conversation, we are hitting the common tasks that are associated with model training and deployment. 00:06:21:08 - 00:06:46:03 So let's take training here. Typically when the user has to train a model, they are going to take their data. They will clean it up, do some pre-processing. Then they will figure out which particular algorithm they should be using. Tune those algorithms in doing the hyperparameter tuning, so on and so forth. All of these are individual tasks. 00:06:46:05 - 00:07:12:15 Our goal is to have the user focus on their business problem and take away the engineering piece of it, take away the technology piece of it, and do it automatically for them. So we have this pipeline which does this, all of it, all of it automatically in a single pass. So it will do pre-processing. It's going to figure out, the appropriate algorithm to use during model building. 00:07:12:17 - 00:07:39:05 It will figure out what are the best set of hyperparameters and what their values should be, during the training process and give you the, the model. So that's one part the second part is we provide an ability to deploy these models via REST interfaces. So once the model is trained they can deploy this. 00:07:39:07 - 00:08:09:09 And thirdly from time to time the users data is going to drift. Or what I mean by that is the train model. The data on which it was trained no longer reflects the reality. And in that case, you have to retrain the model. So we provide tools to measure that drift. And if it goes beyond a certain threshold, then you can go ahead and retrain your model automatically. 00:08:09:11 - 00:08:53:01 So these are a couple of ways in which we have simplified the model training and the deployment for users. Thank you. Thank you very much for this, detailed, answer. And now... So as we discussed about, you know, the, the data not leaving, to a third party, product. But I would like to, to ask, to, Jayant, if, if there were some performance improvement that, users have seen by doing this, ML natively in HeatWave, instead of removing the data, to external platforms. Certainly, Fred. 00:08:53:03 - 00:09:24:01 So there are two aspects to this. There's, there are efficiencies that, result and there are performance improvement because of the way AutoML is implemented and how it works in HeatWave. Let's start with the efficiency first. The first thing as Sanjay was talking about right, is that we've automated the pipeline. You have to only focus on what is your business problem and how that maps to a particular task in machine learning. 00:09:24:01 - 00:09:47:04 So for example, do I want to predict something. And therefore use regression, do I want to identify or label something and therefore use classification. And AutoML will figure out which particular algorithm. There are multiple ways in which you may do regression, for example, which particular one applies or is best suited for the task at hand. Right. 00:09:47:04 - 00:10:15:06 So efficiency there is AutoML handles it in a single pass, not the normal process requires you to have an iterative do things multiple times. Try it on multiple algorithms or different ways of solving the same problem, and then evaluate which one does it best. AutoML does this in a single pass by. Very smart ways of sampling your data and running quick tests to identify the best approach. 00:10:15:08 - 00:10:35:15 So that's the efficiency. The second when it does this, why is it so fast? It's so fast because it uses it the full capability of the underlying infrastructure, which is the HeatWave nodes. Right. The number of heat wave nodes you've got the size of these HeatWave nodes. It does these things in parallel and fully utilizes the infrastructure. 00:10:35:17 - 00:11:02:22 So what is the benefit of that? You can do things a) faster and b) potentially cheaper, which gives you the luxury of trying multiple what if scenarios. Right. It's not a laborious process. It's more efficient. So if you know exactly what you want, you get it done faster. If you want to try multiple scenarios, you can do that faster and at a lower cost. 00:11:03:00 - 00:11:32:23 So that is the efficiency and the performance enhancements that you get. Awesome. So all right let's switch gears a little bit GenAI is one of the latest additions to HeatWave. What specific GenAI features are currently available or if you can talk about them in development? So, Scott, indeed. GenAI has been one of the latest additions to our platform and frankly it encompasses two separate components. 00:11:33:01 - 00:12:04:05 One is the customer usage part and the second is the technology part. So from a customer usage perspective, what people want to do is bring in their knowledge bases. And by that I mean bring in the PDF documents, PowerPoint documents, so on and so forth, and ask questions of that or get summaries of that text, or translate that text into another language, or develop a chatbot around it so that they can get answers, things of that nature. 00:12:04:07 - 00:12:36:19 So what we have done is keeping this in mind for an enterprise setting. We have developed the technology components which are needed to serve these needs, such that going back to the earlier conversation, they focus on their business needs, and we provide them the tools to actually, serve those or we provide the plumbing to do so. So what we have done is to provide a full pipeline to ingest their documents and create the knowledge base. 00:12:36:19 - 00:13:04:06 And by that I mean bring in your PDF documents, which will get converted into embeddings and stored into vector store. So we provide all of that. And then we provide ways in which to search this knowledge base and give answers to the users via easy to use APIs like retrieval augmented generation (RAG), or doing just semantic search over those documents or doing summarization or translation. 00:13:04:08 - 00:13:32:14 We also have the ability to, support chat. Now, one very interesting thing that we have done is to provide the users a LLMs, which ran on commodity hardware. Jayant was talking about running this on HeatWave nodes. So we, we, we have provided these LLMs which found on commodity hardware so that people can quickly prototype their application to test it out. 00:13:32:16 - 00:14:00:13 And if they like the results of the like the performance, they stick with it. Or if they want high performance, and then they go to our OCI GenAI services and use those LLMs. So, quick prototyping, quick testing, quick evaluation done using the commodity LLMs commodity, the LLMs which are running on commodity hardware. And then they can use the OCI GenAI LLMs to get high performance. 00:14:00:15 - 00:14:26:07 Now going to your question about what newer things are coming in. You know, OCI is at the forefront of this revolution. And they are providing, newer models, newer frameworks and tools. And we are continuously incorporating MySQL and Heatwave with, their tools and technologies so that we can provide the same to our customers, in coming weeks and months. 00:14:26:08 - 00:14:53:23 Yeah. And then an example would be the agent framework. Right. Integrating with the agent framework, integrating with the hosted frontier models on GPU infrastructure. So you develop your prototype, develop, you can choose to deploy. The integration is preexisting. You don't it's not an after the fact exercise. You use the same infrastructure. And we provide the pre-built integration with those AI services. 00:14:54:01 - 00:15:28:22 Excellent. So because you are talking about, using, these, LLMs, on commodity hardware, which model, are available from these LLMs and how are we using them? So, we provide as I mentioned earlier, we provide two sets of, access to two sets pof LLMs. One are the smaller parameters LLMs, which are running on, commodity hardware on the same cluster on which HeatWave is run. 00:15:29:00 - 00:16:00:12 And in that context, we provide models like, mistral 8 billion or, llama 3 billion, 1 billion. And we are continually sort of upgrading these as newer models come into play, since this is a very fast moving fleet. Also, once people have prototyped it, once people have gotten to see the results, if they want to switch over to OCI GenAI LLMs, then we provide access to the wide variety of LLMs that OCI provides. 00:16:00:14 - 00:16:31:11 Those includes, you know, things like all the llama models, the upcoming models from all the other vendors, all of that is going to be accessible through, all of that is accessible through, MySQL. And these can be used for, as I said, but I do have use cases, be it retrieval augmented generation, chat bots, summarization, translation and many other use cases that, you know, customers continue to think of. 00:16:31:11 - 00:16:57:06 And we are always amazed at the way they are using, this technology. So when we talked to Matt Quinn, in our first episode, he had mentioned, he briefly touched on, security and privacy issues when it comes to GenAI or when it comes to AI. And we all know that OCI's, biggest concerns are and always have been, security and privacy. 00:16:57:07 - 00:17:26:12 How is data privacy managed when training models or generating text using GenAI in HeatWave? Thank you for that, Scott. So security and privacy and data privacy are and have always been Oracle's primary concern. Right. It is true for the OCI the infrastructure. And it was built with security in mind. It has been true of Oracle's ever since Oracle since its inception. 00:17:26:14 - 00:18:04:09 So that is core that is our DNA including for MySQL and Heatwave. So the two primary ways in which this is a benefit here. Number one, the data doesn't move. Number two when it does have to move, as you said, in the case of text generation etc., we don't use only send the relevant snippets. So Sanjay talked about the fact is that you just the documents and then you create embeddings, which essentially that means is you look at various snippets, the model creates a semantic, you know, captures the semantics of it. 00:18:04:09 - 00:18:33:08 So that you can do a similarity. Right. Do you want to search for, GenAI? You'll also get documents that talk about retrieval augmented generation. Even though you didn't ask about retrieval augmented generation, because the embeddings know that these two things are related. So we extract the relevant content and only that is then sent. For example, if you want to if you're using in database a LLMs nothing is sent anywhere, everything stays in your environment. 00:18:33:09 - 00:18:58:04 If you're using an OCI service, for example, the relevant pieces sent that you get the answer and OCI is built, as you said, with security in mind. So the relevant piece that you sent is discarded after it has answer your question. So once again, your data doesn't leave the ecosystem. It stays within the Oracle infrastructure. Your infrastructure. Great and secure like we want. 00:18:58:06 - 00:19:29:03 Exactly. So thank you very much. No. The tricky question that, we have every time we present, anything new or compelling, at conference and stuff, it's that. And I would like to ask you if you could share, with us, a few compelling customer stories or a real world application of, the, ML engineand GenAI in HeatWave, because it's bice theoretically. 00:19:29:03 - 00:19:47:23 But are people using it? Really? And who are they? If you can say, I know we cannot always share names, but maybe some, are public. So if you can explain us a bit of that would be great. Yeah, Fred, that's a very good question. And indeed, all of this makes sense if customers are using it. 00:19:48:01 - 00:20:31:14 So, yeah, let me share a couple of them. In fact, let me share two of them, one on GenAI and one on, AutoML, so on GenAI. And this is a public reference. We have a customers, Smarter D their goal, or what they provide is a platform to track the compliance of a company's policy to the guidelines released by various, bodies, for example, DoD. now, in the past, what they have done is to have an auditor, which is going to compare the policy with respect to the guideline and then say if a specific companies are adhering to those guidelines or not, which is kind of 00:20:31:14 - 00:21:04:12 a, manual process, fairly intensive, right? What they did was to start using, Heatwave GenAI and they stored both the policies and the guidelines in HeatWave GenAI's knowledge based. And then using our RAG using our other capabilities, they compared the policy with the guidance automatically. And the results of these are then given to the auditor. 00:21:04:14 - 00:21:28:15 So now what does what does this do. This reduces the amount of time the auditor has to spend comparing the two. Instead, he or she gets a, very short description of what has been done, what has not been done, and whereby they can make a judgment whether it is indeed adhering or not. So it makes the auditor far more efficient compared to earlier. 00:21:28:17 - 00:21:57:04 So that's a that's a use case in the compliance space for HeatWave. GenAI, I, I would give me another example here. I can't, quote the name, but this is in the industrial setting where the customer wants to have early signals of failures of their computer servers. So they are running, large amount of, workloads. 00:21:57:07 - 00:22:39:20 And these workloads generate logs, and those logs sometimes are indicative of failures. So what they do is they use our log anomaly detection, tool models that, they have built actually, and feed their logs continuously to these models. And these models are trained to detect abnormal logs. So when the model notices abnormal logs, it's going to go ahead and give you an early warning of the fact that a particular system is likely to fail. 00:22:39:22 - 00:23:03:14 An example of that would be hey, your CPU utilization is trending higher, or your memory utilization is going up, or the number of network connections are going up. All of this the model best efforts based on the data it was trained earlier and then in real time. It looks at the newer data and figures out, you know what, this is looking problematic. 00:23:03:20 - 00:23:29:22 So let me give you an early warning. And that's what they are doing. So these are two, kind of use cases that some of our customers are two of our customers of, have developed and, use. Great. So just to add to what they were saying on the second case, right, it's high tech manufacturing, their assembly line, the manufacturing process, which is if something goes wrong and that process is interrupted, that's expensive. 00:23:30:00 - 00:23:52:13 Right. And that is what is being, the benefit here is that they get an early warning and they can mitigate, so they get a chance. They have monitoring systems. Obviously, the monitoring system tells you after the fact something went wrong. Go fix it. The log analytics, the predictive part gives you an early warning so you can mitigate the likely. 00:23:52:15 - 00:24:21:14 Yeah, you can either prevent or you can reduce the effect of something getting affected on the assembly line. That is really cool. So it's interesting that, Sanjay, you talked about the compliance issue because when we talked to Matt in the last episode, that was actually an example that he used as well, but he was just talking generally not specific, you know, not a specific, customer or, real world, application of it. 00:24:21:14 - 00:24:50:06 He was just throwing it out there like, this is what you could do. So it's really cool that you brought that up to, to wrap up, what advice would you give to MySQL users who want to start exploring AI with Heatwave? Simple. Get started with the always free instance. So today, and in case y'all aren't aware and folks aren't aware, you can get an always free instance in your own region in OCI. 00:24:50:06 - 00:25:14:15 If you, you know, sign up on OCI account and create an always free your instance. And that actually has the GenAI and AutoML capability. Now to help you get started, we've got live labs that walk you through scenarios. Right. So we've got 4 or 5 live labs - two on GenAI, two on ML and a few on specific use cases. 00:25:14:15 - 00:25:45:16 For example, wholesale SailGP uses machine learning with MySQL to help these people perform better in a race. But more importantly, once you we also have a collection of videos. Some of them by you Scott, for example, on what you can do with AutoML, what you can do with GenAI, what you can do with it in healthcare, what can you do with it in e-commerce and etc.. 00:25:45:16 - 00:26:07:00 So and sample code on GitHub on our GitHub site, sample notebooks, sample SQL code. Try them out on your always free instance. Or if you have access to a beefier instance because you have credits. Try it out on that. And then if you have a specific thing in mind, feel free to reach out to any one of us. 00:26:07:00 - 00:26:25:06 If you want help to do a prototype or do a POC, we'd be more than happy to do that. Nice. Thank you very much. So it was a pleasure to have you both here, Sanjay and Jayant. So thank you very much. To participate to this podcast. So thank you and bye bye. Bye bye. Thank you. Bye bye. Thank you. 00:26:25:06 - 00:26:53:23 Guys, that's a wrap on this episode of MySQL: Sakila Speaks. Thanks for hanging out with us. If you enjoyed listening, please click subscribe to get all the latest episodes. We would also love your reviews and ratings on your podcast app. Be sure to join us for the next episode of MySQL: Sakila Speaks. Episode Transcript: 00:00:00:00 - 00:00:32:01 Unknown Welcome to Inside MySQL: Sakila Speaks. A podcast dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL project updates and insightful interviews with members of the MySQL community. Sit back and enjoy as your hosts bring you the latest updates on your favorite open source database. Let's get started! 00:00:32:03 - 00:00:54:17 Unknown Hello and welcome to Sakila Speaks, the podcast dedicated to MySQL. I am leFred and I'm Scott Stroz. Today for the second episode of season three dedicated on AI. I am pleased to welcome Sanjay Jinturkar. Sorry if I pronounce it badly. No, you did it right. Hi there. Thank you. So Sanjay is the senior director at Oracle based in New Jersey. 00:00:54:19 - 00:01:21:13 Unknown He leads product development for it with AutoML and GenAI with a strong focus on integrating these technologies directly into each HeatWave database. And Sanjay has been instrumental in enhancing HeatWave's machine learning and GenAI tool sets, enabling use case like predictive maintenance, fraud detection and intelligent dicument and Q&A. And also we have a second guest today. 00:01:21:13 - 00:01:48:21 Unknown It's a Jayant Sharma. Hi, Jayant. Hello. So Jayant Sharma is senior director of product management at Oracle. He has over 20 years of experience in databases, spatial analytics and application development. He's currently focused on the product strategy and design of the Heatwave MySQL managed services offering. Hey Fred. Thank you, both of you for joining us today. So I'm going to dive right in with the question for Jayant. 00:01:48:23 - 00:02:12:14 Unknown Why did Oracle decide to integrate machine learning in generative AI capabilities directly into HeatWave? Thank you Scott, first for this opportunity. And yes, we have to start with first, you know, talking about MySQL, right? MySQL is the world's most popular open source database. And what do all of these customers, the thousands of customers that they have, do with it? 00:02:12:16 - 00:02:47:05 Unknown They manage a business process. They manage their enterprise, right? Their focus is on what they want to do, why they want to do it, and not so much the how. That's what MySQL makes it easier. And Heatwave is a managed service on MySQL. Okay, so as folks are modernizing their applications, taking advantage of new technology, they want to be able to use new workloads, new analytics, and modernize their business processes, make it more efficient, make it more effective. 00:02:47:07 - 00:03:09:17 Unknown In order to do that, they want to do things such as machine learning and use the benefits of generative AI. However, what they want to focus on, as we said, is what they want, why they want to do it and not the how. So they don't want to have to think about. I have all of this data that's potentially a goldmine. 00:03:09:19 - 00:03:40:07 Unknown How do I extract nuggets from it, and how do I safely move it and transfer in between the best of breed tools? I want to be able to do things where they are. I want to bring the capabilities, these new capabilities to my data. I don't want to take my data to where those capabilities are exposed, right? That is why we made it possible to do machine learning and GenAI where your gold mine is, where your data is in MySQL in Heatwave. 00:03:40:09 - 00:04:06:07 Unknown Awesome. Thank you. So, I would like to ask you to Sanjay, then. How Do the the, machine learning engine in the HeatWave, offer differ from, using external machine learning pipelines with the with the data we have in the database? It differs in a couple of weeks, specifically how the models are built, who builds them and where they are built. 00:04:06:09 - 00:04:46:09 Unknown So our pipeline, we provide, automated pipeline, which can take your data in MySQL database or Lakehouse, and then automatically generate the model for you. So it does the, usual tasks of pre-processing, hyperparameter optimization, and, data cleansing, etc. automatically so that the user doesn't have to do that. We would even go ahead and do, explanations for you in certain use cases, given that this is automated, a big side effect of that is users don't need to be experts in machine learning. 00:04:46:11 - 00:05:16:08 Unknown What they need to focus on is their business problem, and how that business problem maps onto one of the features that we provide. From there onwards, the pipeline takes over and generates the models for it. And the third piece is that all of this work is done within HeatWave. We don't take the data going back to what Jayant was say, saying, we have got machine learning and generative AI to where the data resides, not the other way around. 00:05:16:10 - 00:05:47:20 Unknown So we are building the models inside Heatwave whereby the data is not taken out and thereby it is more secure and the user does not have to worry about data leakage or track where all they have taken the data and how many times they have done it. So these are the three key ways in which we differ. If you use one of the third party solutions, they will end up asking you to do this on your own or asking you to take the data out of the database and build it on your machine, so on and so forth. 00:05:47:22 - 00:06:21:06 Unknown But we have made it automated, easy to use and very secure to do so. So Sanjay, we're going to stay with you to, to keep talking about AutoML in HeatWave. So what are some of the key features of AutoML and how does it simplify model training and deployment for users? Fantastic question. You know, as I said in my in the previous, conversation, we are hitting the common tasks that are associated with model training and deployment. 00:06:21:08 - 00:06:46:03 Unknown So let's take training here. Typically when the user has to train a model, they are going to take their data. They will clean it up, do some pre-processing. Then they will figure out which particular algorithm they should be using. Tune those algorithms in doing the hyperparameter tuning, so on and so forth. All of these are individual tasks. 00:06:46:05 - 00:07:12:15 Unknown Our goal is to have the user focus on their business problem and take away the engineering piece of it, take away the technology piece of it, and do it automatically for them. So we have this pipeline which does this, all of it, all of it automatically in a single pass. So it will do pre-processing. It's going to figure out, the appropriate algorithm to use during model building. 00:07:12:17 - 00:07:39:05 Unknown It will figure out what are the best set of hyperparameters and what their values should be, during the training process and give you the, the model. So that's one part the second part is we provide an ability to deploy these models via REST interfaces. So once the model is trained they can deploy this. 00:07:39:07 - 00:08:09:09 Unknown And thirdly from time to time the users data is going to drift. Or what I mean by that is the train model. The data on which it was trained no longer reflects the reality. And in that case, you have to retrain the model. So we provide tools to measure that drift. And if it goes beyond a certain threshold, then you can go ahead and retrain your model automatically. 00:08:09:11 - 00:08:53:01 Unknown So these are a couple of ways in which we have simplified the model training and the deployment for users. Thank you. Thank you very much for this, detailed, answer. And now... So as we discussed about, you know, the, the data not leaving, to a third party, product. But I would like to, to ask, to, Jayant, if, if there were some performance improvement that, users have seen by doing this, ML natively in HeatWave, instead of removing the data, to external platforms. Certainly, Fred. 00:08:53:03 - 00:09:24:01 Unknown So there are two aspects to this. There's, there are efficiencies that, result and there are performance improvement because of the way AutoML is implemented and how it works in HeatWave. Let's start with the efficiency first. The first thing as Sanjay was talking about right, is that we've automated the pipeline. You have to only focus on what is your business problem and how that maps to a particular task in machine learning. 00:09:24:01 - 00:09:47:04 Unknown So for example, do I want to predict something. And therefore use regression, do I want to identify or label something and therefore use classification. And AutoML will figure out which particular algorithm. There are multiple ways in which you may do regression, for example, which particular one applies or is best suited for the task at hand. Right. 00:09:47:04 - 00:10:15:06 Unknown So efficiency there is AutoML handles it in a single pass, not the normal process requires you to have an iterative do things multiple times. Try it on multiple algorithms or different ways of solving the same problem, and then evaluate which one does it best. AutoML does this in a single pass by. Very smart ways of sampling your data and running quick tests to identify the best approach. 00:10:15:08 - 00:10:35:15 Unknown So that's the efficiency. The second when it does this, why is it so fast? It's so fast because it uses it the full capability of the underlying infrastructure, which is the HeatWave nodes. Right. The number of heat wave nodes you've got the size of these HeatWave nodes. It does these things in parallel and fully utilizes the infrastructure. 00:10:35:17 - 00:11:02:22 Unknown So what is the benefit of that? You can do things a) faster and b) potentially cheaper, which gives you the luxury of trying multiple what if scenarios. Right. It's not a laborious process. It's more efficient. So if you know exactly what you want, you get it done faster. If you want to try multiple scenarios, you can do that faster and at a lower cost. 00:11:03:00 - 00:11:32:23 Unknown So that is the efficiency and the performance enhancements that you get. Awesome. So all right let's switch gears a little bit GenAI is one of the latest additions to HeatWave. What specific GenAI features are currently available or if you can talk about them in development? So, Scott, indeed. GenAI has been one of the latest additions to our platform and frankly it encompasses two separate components. 00:11:33:01 - 00:12:04:05 Unknown One is the customer usage part and the second is the technology part. So from a customer usage perspective, what people want to do is bring in their knowledge bases. And by that I mean bring in the PDF documents, PowerPoint documents, so on and so forth, and ask questions of that or get summaries of that text, or translate that text into another language, or develop a chatbot around it so that they can get answers, things of that nature. 00:12:04:07 - 00:12:36:19 Unknown So what we have done is keeping this in mind for an enterprise setting. We have developed the technology components which are needed to serve these needs, such that going back to the earlier conversation, they focus on their business needs, and we provide them the tools to actually, serve those or we provide the plumbing to do so. So what we have done is to provide a full pipeline to ingest their documents and create the knowledge base. 00:12:36:19 - 00:13:04:06 Unknown And by that I mean bring in your PDF documents, which will get converted into embeddings and stored into vector store. So we provide all of that. And then we provide ways in which to search this knowledge base and give answers to the users via easy to use APIs like retrieval augmented generation (RAG), or doing just semantic search over those documents or doing summarization or translation. 00:13:04:08 - 00:13:32:14 Unknown We also have the ability to, support chat. Now, one very interesting thing that we have done is to provide the users a LLMs, which ran on commodity hardware. Jayant was talking about running this on HeatWave nodes. So we, we, we have provided these LLMs which found on commodity hardware so that people can quickly prototype their application to test it out. 00:13:32:16 - 00:14:00:13 Unknown And if they like the results of the like the performance, they stick with it. Or if they want high performance, and then they go to our OCI GenAI services and use those LLMs. So, quick prototyping, quick testing, quick evaluation done using the commodity LLMs commodity, the LLMs which are running on commodity hardware. And then they can use the OCI GenAI LLMs to get high performance. 00:14:00:15 - 00:14:26:07 Unknown Now going to your question about what newer things are coming in. You know, OCI is at the forefront of this revolution. And they are providing, newer models, newer frameworks and tools. And we are continuously incorporating MySQL and Heatwave with, their tools and technologies so that we can provide the same to our customers, in coming weeks and months. 00:14:26:08 - 00:14:53:23 Unknown Yeah. And then an example would be the agent framework. Right. Integrating with the agent framework, integrating with the hosted frontier models on GPU infrastructure. So you develop your prototype, develop, you can choose to deploy. The integration is preexisting. You don't it's not an after the fact exercise. You use the same infrastructure. And we provide the pre-built integration with those AI services. 00:14:54:01 - 00:15:28:22 Unknown Excellent. So because you are talking about, using, these, LLMs, on commodity hardware, which model, are available from these LLMs and how are we using them? So, we provide as I mentioned earlier, we provide two sets of, access to two sets pof LLMs. One are the smaller parameters LLMs, which are running on, commodity hardware on the same cluster on which HeatWave is run. 00:15:29:00 - 00:16:00:12 Unknown And in that context, we provide models like, mistral 8 billion or, llama 3 billion, 1 billion. And we are continually sort of upgrading these as newer models come into play, since this is a very fast moving fleet. Also, once people have prototyped it, once people have gotten to see the results, if they want to switch over to OCI GenAI LLMs, then we provide access to the wide variety of LLMs that OCI provides. 00:16:00:14 - 00:16:31:11 Unknown Those includes, you know, things like all the llama models, the upcoming models from all the other vendors, all of that is going to be accessible through, all of that is accessible through, MySQL. And these can be used for, as I said, but I do have use cases, be it retrieval augmented generation, chat bots, summarization, translation and many other use cases that, you know, customers continue to think of. 00:16:31:11 - 00:16:57:06 Unknown And we are always amazed at the way they are using, this technology. So when we talked to Matt Quinn, in our first episode, he had mentioned, he briefly touched on, security and privacy issues when it comes to GenAI or when it comes to AI. And we all know that OCI's, biggest concerns are and always have been, security and privacy. 00:16:57:07 - 00:17:26:12 Unknown How is data privacy managed when training models or generating text using GenAI in HeatWave? Thank you for that, Scott. So security and privacy and data privacy are and have always been Oracle's primary concern. Right. It is true for the OCI the infrastructure. And it was built with security in mind. It has been true of Oracle's ever since Oracle since its inception. 00:17:26:14 - 00:18:04:09 Unknown So that is core that is our DNA including for MySQL and Heatwave. So the two primary ways in which this is a benefit here. Number one, the data doesn't move. Number two when it does have to move, as you said, in the case of text generation etc., we don't use only send the relevant snippets. So Sanjay talked about the fact is that you just the documents and then you create embeddings, which essentially that means is you look at various snippets, the model creates a semantic, you know, captures the semantics of it. 00:18:04:09 - 00:18:33:08 Unknown So that you can do a similarity. Right. Do you want to search for, GenAI? You'll also get documents that talk about retrieval augmented generation. Even though you didn't ask about retrieval augmented generation, because the embeddings know that these two things are related. So we extract the relevant content and only that is then sent. For example, if you want to if you're using in database a LLMs nothing is sent anywhere, everything stays in your environment. 00:18:33:09 - 00:18:58:04 Unknown If you're using an OCI service, for example, the relevant pieces sent that you get the answer and OCI is built, as you said, with security in mind. So the relevant piece that you sent is discarded after it has answer your question. So once again, your data doesn't leave the ecosystem. It stays within the Oracle infrastructure. Your infrastructure. Great and secure like we want. 00:18:58:06 - 00:19:29:03 Unknown Exactly. So thank you very much. No. The tricky question that, we have every time we present, anything new or compelling, at conference and stuff, it's that. And I would like to ask you if you could share, with us, a few compelling customer stories or a real world application of, the, ML engineand GenAI in HeatWave, because it's bice theoretically. 00:19:29:03 - 00:19:47:23 Unknown But are people using it? Really? And who are they? If you can say, I know we cannot always share names, but maybe some, are public. So if you can explain us a bit of that would be great. Yeah, Fred, that's a very good question. And indeed, all of this makes sense if customers are using it. 00:19:48:01 - 00:20:31:14 Unknown So, yeah, let me share a couple of them. In fact, let me share two of them, one on GenAI and one on, AutoML, so on GenAI. And this is a public reference. We have a customers, Smarter D their goal, or what they provide is a platform to track the compliance of a company's policy to the guidelines released by various, bodies, for example, DoD. now, in the past, what they have done is to have an auditor, which is going to compare the policy with respect to the guideline and then say if a specific companies are adhering to those guidelines or not, which is kind of 00:20:31:14 - 00:21:04:12 Unknown a, manual process, fairly intensive, right? What they did was to start using, Heatwave GenAI and they stored both the policies and the guidelines in HeatWave GenAI's knowledge based. And then using our RAG using our other capabilities, they compared the policy with the guidance automatically. And the results of these are then given to the auditor. 00:21:04:14 - 00:21:28:15 Unknown So now what does what does this do. This reduces the amount of time the auditor has to spend comparing the two. Instead, he or she gets a, very short description of what has been done, what has not been done, and whereby they can make a judgment whether it is indeed adhering or not. So it makes the auditor far more efficient compared to earlier. 00:21:28:17 - 00:21:57:04 Unknown So that's a that's a use case in the compliance space for HeatWave. GenAI, I, I would give me another example here. I can't, quote the name, but this is in the industrial setting where the customer wants to have early signals of failures of their computer servers. So they are running, large amount of, workloads. 00:21:57:07 - 00:22:39:20 Unknown And these workloads generate logs, and those logs sometimes are indicative of failures. So what they do is they use our log anomaly detection, tool models that, they have built actually, and feed their logs continuously to these models. And these models are trained to detect abnormal logs. So when the model notices abnormal logs, it's going to go ahead and give you an early warning of the fact that a particular system is likely to fail. 00:22:39:22 - 00:23:03:14 Unknown An example of that would be hey, your CPU utilization is trending higher, or your memory utilization is going up, or the number of network connections are going up. All of this the model best efforts based on the data it was trained earlier and then in real time. It looks at the newer data and figures out, you know what, this is looking problematic. 00:23:03:20 - 00:23:29:22 Unknown So let me give you an early warning. And that's what they are doing. So these are two, kind of use cases that some of our customers are two of our customers of, have developed and, use. Great. So just to add to what they were saying on the second case, right, it's high tech manufacturing, their assembly line, the manufacturing process, which is if something goes wrong and that process is interrupted, that's expensive. 00:23:30:00 - 00:23:52:13 Unknown Right. And that is what is being, the benefit here is that they get an early warning and they can mitigate, so they get a chance. They have monitoring systems. Obviously, the monitoring system tells you after the fact something went wrong. Go fix it. The log analytics, the predictive part gives you an early warning so you can mitigate the likely. 00:23:52:15 - 00:24:21:14 Unknown Yeah, you can either prevent or you can reduce the effect of something getting affected on the assembly line. That is really cool. So it's interesting that, Sanjay, you talked about the compliance issue because when we talked to Matt in the last episode, that was actually an example that he used as well, but he was just talking generally not specific, you know, not a specific, customer or, real world, application of it. 00:24:21:14 - 00:24:50:06 Unknown He was just throwing it out there like, this is what you could do. So it's really cool that you brought that up to, to wrap up, what advice would you give to MySQL users who want to start exploring AI with Heatwave? Simple. Get started with the always free instance. So today, and in case y'all aren't aware and folks aren't aware, you can get an always free instance in your own region in OCI. 00:24:50:06 - 00:25:14:15 Unknown If you, you know, sign up on OCI account and create an always free your instance. And that actually has the GenAI and AutoML capability. Now to help you get started, we've got live labs that walk you through scenarios. Right. So we've got 4 or 5 live labs - two on GenAI, two on ML and a few on specific use cases. 00:25:14:15 - 00:25:45:16 Unknown For example, wholesale SailGP uses machine learning with MySQL to help these people perform better in a race. But more importantly, once you we also have a collection of videos. Some of them by you Scott, for example, on what you can do with AutoML, what you can do with GenAI, what you can do with it in healthcare, what can you do with it in e-commerce and etc.. 00:25:45:16 - 00:26:07:00 Unknown So and sample code on GitHub on our GitHub site, sample notebooks, sample SQL code. Try them out on your always free instance. Or if you have access to a beefier instance because you have credits. Try it out on that. And then if you have a specific thing in mind, feel free to reach out to any one of us. 00:26:07:00 - 00:26:25:06 Unknown If you want help to do a prototype or do a POC, we'd be more than happy to do that. Nice. Thank you very much. So it was a pleasure to have you both here, Sanjay and Jayant. So thank you very much. To participate to this podcast. So thank you and bye bye. Bye bye. Thank you. Bye bye. Thank you. 00:26:25:06 - 00:26:53:23 Unknown Guys, that's a wrap on this episode of MySQL: Sakila Speaks. Thanks for hanging out with us. If you enjoyed listening, please click subscribe to get all the latest episodes. We would also love your reviews and ratings on your podcast app. Be sure to join us for the next episode of MySQL: Sakila Speaks.

7 de ago de 2025 - 26 min

AI for the Rest of Us: A High-Level Overview

Kick off Season 3 of Inside MySQL: Sakila Speaks as leFred and Scott welcome Matt Quinn for an engaging introduction to the world of Artificial Intelligence. In this episode, we step back from the database and explore what AI really is, how it's shaping society and technology, and why it matters to anyone in tech today. Whether you're just curious about AI or eager to understand its key concepts, join us as we break down the basics and set the stage for a season of discovery. ------------------------------------------------------------ Episode Transcript: 00:00:00:00 - 00:00:31:22 Welcome to Inside MySQL: Sakila Speaks. A podcast dedicated to all things MySQL. We bring you the latest news from the MySQL team, MySQL project updates and insightful interviews with members of the MySQL community. Sit back and enjoy as your hosts bring you the latest updates on your favorite open source database. Let's get started! 00:00:32:00 - 00:00:58:22 Hello and welcome to Sakila Speaks, the podcast dedicated to MySQL. I am leFred and I'm Scott Stroz. Join us today. It's Matt Quinn, vice president and head of AI at Orracle. Matt leads how Oracle Cloud Infrastructure's AI services are adopted by customers in EMEA. Matt brings deep expertise in enterprise software strategy and a passion for making AI both powerful and its adoption practical. 00:00:59:00 - 00:01:21:03 Today he is here to help us unpack what GenAI really means for the organizations we work for and buy from, and what it means for developers, data professionals, and MySQL users everywhere. Matt, welcome to Inside MySQL: Sakila Speaks. It's great to have you with us to kick off season three of our podcast. Thank you very much, Fred, Scott, great to be with you. 00:01:21:08 - 00:01:43:21 Looking forward to, to an interesting conversation and getting us going for season three. Awesome. Matt, thanks for being here with us. So right off the bat, when most people hear the term AI, they probably think of chat bots. But that's just one form of AI. Can you help provide us with like a high overview of the different types of AI that exist? 00:01:43:23 - 00:02:15:10 Absolutely. And I think AI and itself is a broad church, right? There's a number of different, kinds of AI. The term actually dates back to the 1950s as a concept for you know, machine thinking. It's had a couple of false dawns over the time when compute and data to train. I wasn't really quite ready for this, but as we got into the 90s and the early noughties, as compute power grew, as storage grew, a confluence of internet accessibility, lots of data becoming available, and then we time fed forward. 00:02:15:12 - 00:02:33:12 We found that organizations could do the fundamentals of what we know of AI today things like machine learning. So learning a trend and a pattern, looking at what happened in the past and do a statistical regression on that to predict some future outcome based on what happened in the past. And we use examples of this today without even knowing it. 00:02:33:12 - 00:02:52:11 You know, is this email that's coming into my email system, is this spam or not spam? Those kind to classifier types of AI have been prevalent for the last ten, 15, 20 years, and we're moving forward to where AI has this more kind of human interaction. It's surfacing and it's suddenly popped into the zeitgeist, for for conversation. 00:02:52:15 - 00:03:14:03 So it has multiple facets. We have machine learning trained something to do, something very specific, show it, something that it's seen before and enable it to predict the future based on what it's learned. But we're starting to see this wave of generative AI do more advanced, more nuanced, more humanlike things, and I think that's a really powerful kind of inflection point that we've seen in the last two, three years. 00:03:14:05 - 00:03:39:02 Thank you. So because in your first, answer, you said you said about the 70s and 90s, but why is I having such a huge moment right now? So what changed since that time? I think that the real inflection point is the the kind of conversational nature of it. You can speak human to it, and it can speak human back to you. 00:03:39:04 - 00:04:01:13 If I think about how compute evolved, you know, it used to be I had to type cryptic commands on the green screen in order to be able to use a computer, which meant the audience of people who could use computer to do something was very limited. In the 80s is the GUI. The graphical user interface kind of emerged suddenly it was a keyboard in a mouse, and the population of people who could interact with the computer was much broader. 00:04:01:15 - 00:04:19:02 Mobile did the same for us, but you still had to learn things. You had to take the human to interact in a way that made sense to the computer. With generative AI, I think what's happened in the last 2 or 3 years is actually the computer is coming to meet the human. Suddenly it's able to interact with us in our language. 00:04:19:07 - 00:04:37:19 I can have a conversation with it. I can ask a question in natural language. Now I might need to engineer my prompts to get the right kind of outcomes to guide it. Actually, the computer understands what I say. It can meet my language and understand that interact with me in a very human way. And I think that's caught the imagination of people. 00:04:37:19 - 00:04:59:18 They've suddenly had this 'aha' moment and that then has gone from, you know, an academic or data or IT kind of problem. It's broken out of it and gone into the board to say, well, actually, what does this mean? How will this work? And as people start to imagine what it could do beyond, you know, asking a question about, you know, what recipe do I have? 00:04:59:18 - 00:05:20:13 Or how can I find an answer to a question I could historic could use a search engine for, but save me some heavy lifting organizations to look at it and say, oh, hang on a minute. What manual processes in my organization...What low value repetitive tasks are happening in my organization that this might help me change? So suddenly AI has gone from being an IT conversation to being a business conversation. 00:05:20:13 - 00:05:48:15 It's it's got the opportunity. It's got the ear of the board. And suddenly that's just pivoted the demand and the interest in AI I think in the last couple of years. That is quite insightful. So because I has become the big thing in the world and everybody is talking about AI, there's got to be some, some common myths or misconceptions about AI out there that you've heard give us one or a couple that you've you've heard that you need to clear up and be like, that's not actually the case. 00:05:48:17 - 00:06:11:19 So there's a couple of things that I think, reoccurring in the conversations I have with customers, with, with engineers, with particularly people outside of IT. And one of those is around privacy. And I think that the challenge that we have with AI is the first services that really burst this into the public domain. There's kind of ChatGPT services. 00:06:11:19 - 00:06:31:04 There's first, opportunity where you could just go to a website, sign up for free, try something for free, engage with it and have a human like conversation. But that spread like wildfire, like 100 million users in a crazy amount of time. The interesting thing there is that free service, and I always like the phrase if something's free, you are the product. 00:06:31:06 - 00:06:54:21 That's those kinds of public sites where it's, you know, it's a consumer-grade service. There's no charge. The huge costs sitting underneath those models, like running the infrastructure, running the applications, having train the models. So the reality is in that environment, the value exchange it was happening is the prompts that I give that free service are available to be used to retrain the model to extend it, to make the product better. 00:06:54:23 - 00:07:24:01 So you're giving access to the data that you provide through a prompt to the service provider that is running that service. That's the value exchange. Now that's created this perception in people's minds that AI isn't private or safe or secure. And I think the reality is, when you do this in an enterprise context, you can absolutely run those models in a ring fenced way, the same way you'd run a database platform where it's isolated. 00:07:24:07 - 00:07:41:09 It's not sending data back to the model provider, it's secure and it's yours. And that enables you to do things. Bring your private data combined with the intelligence that the model has been trained on with public data. And that's what builds builds a system. But it doesn't have to be a system where you're losing control of that data. 00:07:41:13 - 00:08:02:22 So I think there's a lot of FUD around that fear, uncertainty and doubt. And it's up to us as technologists to help dispel the myths and separate where that might be happening in certain domains. That free service is public services. Maybe that is happening, but in an enterprise it scenario, you absolutely can put the security and privacy guardrails around it to meet the kind of enterprise controls that you'd expect. 00:08:03:00 - 00:08:36:10 Whilst reaping the benefits of the AI productivity gains, that you could have. So I think that that to me is the big one. Awesome. Thank you. So, because you said that, you you talked about AI, in industries, and how it's used. And I really like the analogy with the database. So for us, with MySQL, we really enjoy, the databases, could you, paint a picture of how AI is being used across the industries, or is it just specific, or can we use it, in different ways? 00:08:36:10 - 00:08:58:07 And, now it's a great question. I think, like most technological innovations, the thing that is most disruptive about AI is it has an opportunity to be a general purpose technology. And so if I think about things like the internet and electricity before it, electricity is a general purpose technology, right? It's one thing that it is it's ubiquitous in society. 00:08:58:09 - 00:09:21:11 But it's used for many things. Right? It's used for the lights in my room, for the microphone, the router that's routing this, this conversation to you. It's also used to heat my house. It's it's used to generate, to run factories. It's a general purpose technology. The beauty of that is it's power and it's ubiquity and it's up is only constrained by the imagination of people who take electricity and think about what problems could I solve with it? 00:09:21:13 - 00:09:46:16 I think I will be very similar to that. But it's up to us in industry, in technology to invent ways to use this, that are productive, that deliver value for our organizations or for society at large. And the real opportunity there is is boundless, is captured only by our imagination. What I am seeing is there at the very specific first mover type, use cases that are happening. 00:09:46:20 - 00:10:08:10 And they might be things like, you know, drug discovery and protein folding, like highly academic, data science led things. They're moving really fast because those are things where data scientists were already doing lots of work, they were already doing machine learning. They were already up and running with AI. What I'm noticing is that enterprise adoption is a different kind of material, right? 00:10:08:10 - 00:10:30:00 It's a different kind of IT problem to go solve. So what we're seeing is enterprises are experimenting. They're doing lots of pilots. They're they're kind of engaging in, you know, the art of the possible. How could we use this in our organization? What things do we not know about how to do this? We haven't trained our organizational muscle to be able to go from idea to pilot to production yet. 00:10:30:02 - 00:10:51:11 So what I'm seeing is organizations look at human in the loop scenarios. So they're starting with applications where AI is helping a task that already happens to happen a bit more efficiently, a bit more effectively, or drive more coverage. And my favorite example of this was, is in regulated industries where actually, you know, organizations are a bit fearful of upsetting the regulator. 00:10:51:11 - 00:11:10:22 And, you know, we're using AI. And what's the governance challenge with this? I work with a few organizations. You've actually turned that on his head. And what they've said is, how can we use AI to improve our compliance, and regulatory frameworks. So they were looking at this and saying, well, you know, today we have a contact center and we have a team that listen to all the recordings. 00:11:10:22 - 00:11:30:19 Well, actually, they listen to 5%. They sample the recordings and they look for compliance challenges. And then they use that to inform how they educate people and report with compliance, status. So they said, well, actually, why don't we have I listen to all of the calls and then the team that were previously only listening to 5% can go and mark the AI's homework. 00:11:30:21 - 00:11:47:08 And this creates value because now I've improved my compliance perspective on screening all of my phone calls. And the people who are listening to those calls and marking the AI's homework, they can improve and iterate on the model and make it better over time. So we have that human in the loop. So it's augmenting the capability of a team to do something and improving the outcomes for the organization. 00:11:47:14 - 00:12:09:11 I think when you start with use cases that are in that kind of domain, the organization can learn, can adapt and then understand how do I apply this to other problems. And it really has to come from what's the biggest problem in my organization? What's my strategic objective? How does that relate to a data strategy, to an AI strategy to go solve those business problems I want to solve? 00:12:09:14 - 00:12:31:19 And that's the real connective tissue here. It's not a science experiment. It's not AI for the sake of it, just like it wasn't data for the sake of it. It's about data to solve a business problem, help us take action in our organizations. That's awesome. So the three of us obviously work for Oracle, and there's been a lot of news about what Oracle wants to do in terms of AI. 00:12:31:19 - 00:12:58:02 And, you know, are we currently a significant player in the AI world or are we going to get there eventually, do you think or, you know, is it is there is there some other path for Oracle in terms of AI? I think Oracle has a unique position in a number of ways. So if we think about the news that you're talking about yeah, there's lots in the press today about the huge investments that we're making, the giant partnerships that we're doing. 00:12:58:04 - 00:13:21:13 These are about the industrial scale infrastructure that will be needed by organizations, both to train the next generation of these models, but equally to run and inference them. So if you're an organization that wants to consume AI, you want to do that scale. You need that bulletproof, high performance, low latency infrastructure that is secure and robust in order to run the workloads that are powered by AI. 00:13:21:15 - 00:13:43:20 If you're going to do this in an enterprise fashion, you're going to want to do that in a robust, secure, resilient fashion. So building out that infrastructure, the Oracle Cloud infrastructure that we have today, the strong partnerships we have with, GPU providers and software vendors like Nvidia, these are the kind of raw foundational capabilities at absolutely epic scale that are critical to this success 00:13:43:21 - 00:14:21:20 in AI and Oracle's right at the front of that. Interestingly, though, it's not just about tin in data centers. It's about the software stack. It's about the ability to take that raw compute and augment it securely with robust data practices, bring data into the world, bring AI to where that data lives today. That's where I see Oracle being really powerful between huge database platforms from Oracle to relational database platform to MySQL, these are key capabilities and your key software assets that will help organizations unlock the power of that infrastructure and bring it to life in their organization. 00:14:22:00 - 00:14:53:00 And then at the other end of the spectrum, you have, SAS applications at fusion. These are the business process tools, the systems of record, the organizations trust to do work for their organization. They have key elements of data, and they operate they run business processes in your organization. So the ability to surface the outputs of the AI and applications that business users use so they can understand it, use the, you know, interact with the data, glean insights from it, leverage the power of AI to take action for and with them. 00:14:53:02 - 00:15:13:21 That that combination right across the stack, I think is where Oracle is uniquely positioned. And and hence,I am here. Excellent. Yeah. Very nice to see that Oracle actually is a big player in the AI and had the opportunity to see plenty of, of stuff on that tool like the data centers, who we created the, for it. 00:15:14:03 - 00:15:44:00 And, so and yeah, in MySQL, we, we already also see with MySQL HeatWave what brings to to AI there. But, so with that position of of Oracle, going on lot on AI, do you think it will impact, the, the product portfolio, of Oracle, like some stuff to, like, MySQL we know about it, but for other products, do you think that will also impact them? 00:15:44:02 - 00:16:08:19 This, this role of Oracle in the industry? I think it will. I think it will it will bring a new gravity to, to the solutions that we offer. I think the other component, and you're seeing it with MySQL, you see it with Oracle, you know, actually, how do we take the greatness of the database platforms that we have and extend that to simplify organizations use of new technologies? 00:16:08:23 - 00:16:31:13 And you know, my favorite example of this is how do you enable the existing database to do more of the tasks you need in an AI world? So with that, I'm thinking about vectorization, storage of vectors, the ability to run inferencing close to the data. I don't have to pull all my data out of the database just to then run some inferencing over it. 00:16:31:18 - 00:16:58:15 How do I bring that AI capability directly to where the data lives? So I think we're seeing that with lots of the product innovations. And we're also thinking about like what does this mean to governess. You know, if you have a solution where, you know, you've become used to as an organization governing and managing a relational database, how do I then work in a world where I have unstructured data, structured data I have now vectors, it's these are all living in different store data stores. 00:16:58:17 - 00:17:13:12 How do I govern and control that? How do I make sure that I'm keeping that data in in sync? How do I make sure that I've got my GDPR compliance correct? A customer wants to be forgotten. I've now got more places that I need to forget. The customer. I, you know, update that data because it has to be correct. 00:17:13:14 - 00:17:42:02 So I think this concept and we see it across the MySQL platform, we see it across Oracle database. Actually by bringing the vectorization, the vector storage, the vector generation, the the ability to query right into the database engine, you simplify the operational management, you simplify the governance of that model. It makes it easier to secure, to manage access in ways that your organization is already familiar with, by managing a MySQL estate or by managing an Oracle platform. 00:17:42:06 - 00:18:04:09 So so suddenly you're able to expand the scope of the things that you do without it bringing extra operational and governance, complexity into your organization. So it's already influencing our product portfolio. It's already changing the way that we expand to help organizations take advantage of these new needs, these new demands and services, but bring those in a way that makes them part of the existing ecosystem. 00:18:04:09 - 00:18:22:15 They're using the Oracle. And of course, that will continue to evolve in ways that, you know, if I had a crystal ball, I would I'd be looking at what those might look like. But, you know, the key here is that we're moving early, we're moving fast, and we're learning from those, demands and evolving products to help organizations gain the value of that. 00:18:22:15 - 00:18:49:14 They don't have to invent all of these capabilities themselves. They can consume them baked into the products they already used. So I know from the MySQL side that we have customers who have terabytes or petabytes of data. What role is is that data going to play in building or benefiting from AI? And again, I'm talking particularly about like structured data that would be in a MySQL database. 00:18:49:16 - 00:19:12:13 Got it. So so if I think about that, that kind of structured data often that's going to be data that represents entities or processes in your organization. Right. It is the state of a process or of, of an entity, a customer, an order fulfillment, something that exists in the real world projected into a piece of data in that database. 00:19:12:15 - 00:19:54:03 And if we want AI to be a part of how business gets things done, runs a business process, it's going to need to have secure, robust access to well-trusted, grounded data that represents the real world. And I think that key is where AI, in the large language model, the kind of ChatGPT I can interact with it, I can have a conversation with a process that's that's trained and kind of sealed in its data set that it was trained on, but it brings in intelligence that helps it understand a question, interpret language to, perhaps reason over some of the, the, the assets that you've given it as part of that prompt where it becomes 00:19:54:03 - 00:20:16:23 really powerful is in the process is this commonly been referred to as RAG or resource augmented generation. This is the ability to take your private data and securely add it effectively to the prompt. So you add lots of context to the question that you asked the model. Now I can use its intelligence and the ability to understand based on the public data it was trained on. 00:20:17:01 - 00:20:37:21 And in response to the question that you've asked, it can also now answer that ground in your in your own data. So if that data is the structured data, you know, it's about an order, it's about a product description, it's about a fulfillment or an employee. Then suddenly you have the ability to look at that private data set and reason over it using the intelligence from the large language model. 00:20:37:23 - 00:21:04:08 So data will be fundamental because data represents the real world. Data represents the things that we want our business to do. So if you can bring that data, enable it to be composed with that large language model, with the AI, then the AI suddenly can do things in our organization. It can provide insights into our organization. Or if we think more about agenetic AI, it can start to take action, force or recommend actions enable things for us to be done on our behalf. 00:21:04:10 - 00:21:27:17 I think that's where we start to see the flywheel really turn structured data that represents business processes powered by large language models and simplifying the way that that kind of ecosystem, combines. That's where we'll unlock real business value. Enterprise value, versus helping me my homework. That's great. So thank you very much, for that information. It's very insightful. 00:21:27:19 - 00:22:04:15 So yeah, I'm very happy to, to, to, to start this, this third season, with you, Matt, about, AI and we will see in the future, episodes, also in the next one, everything related to and more in with MySQL, of course. But, I think it's very, very interesting time, for people to test AI and, for the people who will listen to us that they can play, on OCI with HeatWave there is a free, HeatWave that has also, AI capabilities. 00:22:04:18 - 00:22:27:11 There is also the, OCI, GenAI service that can be, useful to play with. I, play with the, with both of them. And it's very, very, very interesting and, surprising. Oh. It works. I don't know if you have something else to add for us, but, we were I was very happy to to to chat with you. 00:22:27:13 - 00:22:48:16 But I I'll challenge one thing, Fred, that you said, and I'll be slightly cheeky on it, but playing with it and experimenting, it is step one to learning what can be done, but will only really learn how to do this when we start to practically apply it to real world problems. So we need to move from this kind of experimentation and pilot phase. 00:22:48:20 - 00:23:05:16 That has to happen. And as individuals, as technologists, we will have to do that to learn and get to grips with this technology. But we do need to find ways as organizations to actually do this in anger. And I think, you know, I always use the phrase if you if you want to run a marathon, you start by getting up and running the five K, right. 00:23:05:20 - 00:23:23:02 And it's you do a real run and it hurts. It hurts like hell when when you do that first one. But it becomes easier as you do more of them and you start to expand scope. You start to get longer. You can do bigger runs. Organizations need to do that same piece and train the organizational muscle. You'll do it with real world projects. 00:23:23:04 - 00:23:42:13 We absolutely need to learn how to do this and experiment and learn. But the best way that an organization can, can learn to do this quickly is to find a real world problem to solve and work back from. Why does this organization need to use AI to do this? What problem can it solve for us? And then think about how can AI help us do it? 00:23:42:16 - 00:24:10:08 I think we can get that flywheel going. The playing with it will inspire us. But that's not the end game, right? That's just this chapter one. Yeah. The playing around I think, feeds the, the, the need for lack of a better word that, you know, a lot of times, like I come from a developer background and a lot of times, customers or clients didn't necessarily ask us for what they wanted. 00:24:10:11 - 00:24:36:05 They asked us for what they thought we could deliver. So the Henry Cole thing. Right. I want a faster horse. Exactly. So instead of, you know, saying, hey, this is the problem we have, they're like, well, this is how we think you can solve it. And I think AI is kind of the same thing where people don't really know what the capabilities are, or they're asking for capabilities that they think are the limits of the AI rather than the capabilities that they actually want. 00:24:36:07 - 00:24:52:02 And I think we're going to get to a point probably, sooner rather than later, that we're going to realize that AI can help us with a lot more stuff than what we think it can do right now. Definitely. And it will it will meet in the middle. Right. The business will be saying, I've got these problems I want to solve. 00:24:52:02 - 00:25:17:13 And I think AI, as part of the solution, and developers and technologists who've taken the time and invested the energy to go learn the technology, to see the are the possible, they can then be inspired by the kinds of things that happen when those two meet in the middle. That's where we'll see a real innovation coming in organizations doing really clever things and taking the great products and services that we've built on Oracle Cloud infrastructure in MySQL, in HeatWave, in Oracle database. 00:25:17:14 - 00:25:36:04 The main aim of those is to make it simpler for organizations to take those ideas very quickly, pilot them and prove value. But it's not about piloting them in isolation. We need no cliffs, right? We need to get to the point where when that pilot's ready, we can securely robustly, deliver that into production and we can scale it. 00:25:36:09 - 00:25:56:19 I think doing these, experiments in these enterprise scale, frameworks, in the tools that we provide that gives organizations a route from pilot to production. And that's the bit that I think organizations are really craving. And it's a we we're about to see a real inflection point on that fantastic. Matt, again, thank you for joining us. 00:25:56:21 - 00:26:10:14 I think this has been a great conversation, and I really think that our listeners are going to get a lot out of it, and hopefully it whets their appetite to learn more about AI in upcoming episodes. Thank you very much for having me. Great, great to talk with you. I look forward to listening to all of their story. 00:26:10:16 - 00:26:30:06 Thank you, thank you. That's a wrap on this episode of Inside MySQL: Sakila Speaks. Thanks for hanging out with us. If you enjoyed listening, please click subscribe to get all the latest episodes. We would also love your reviews and ratings on your podcast app. Be sure to join us for the next episode of Inside MySQL: Sakila Speaks.

25 de jul de 2025 - 26 min