Tech Beats Unplugged

Ep05: "Deploy Local LLMs ๐ข๐ง the Cloud (๐Ÿ๐ŸŽ๐ŸŽ% ๐ƒ๐š๐ญ๐š ๐๐ซ๐ข๐ฏ๐š๐œ๐ฒ)"

11 min ยท 24. Sept. 2024
Episode Ep05: "Deploy Local LLMs ๐ข๐ง the Cloud (๐Ÿ๐ŸŽ๐ŸŽ% ๐ƒ๐š๐ญ๐š ๐๐ซ๐ข๐ฏ๐š๐œ๐ฒ)" Cover

Beschreibung

๐Ÿ‘จ๐Ÿฝโ€๐Ÿš€ Welcome to Episode 05 of "Tech Beats unplugged" This time, we tried something completely crazy โ€“ we're letting the AI hosts take over! That's right ๐Ÿ˜Ž. We're flipping the script and giving the AI the mic to guide us through the fascinating world of local LLMs. but that's not all as this episode is actually inspired by my recent talk at Oracle Cloud World in Vegas. The topic? You guessed it: Local LLMs in the cloud. ๐ŸŒŸ Weโ€™re so excited to share our latest tech Beats show with you๐Ÿงก! We hope you'll enjoy it!!! Topics discussed: 1. (00:00) Introduction 2. (01:00) Why OpenAI Might Not Be Your BFF? 3. (02:40) Local/Open LLMs to the Rescue! 4. (03:38) What's Quantization! 5. (04:30) Where to find these Open LLMs? 6. (05:02) Inference Engines (Ollama)! 7. (05:50) What's a modelfile! 8. (06:40) What about deploying local AI to the cloud?(OKE/managed kubernetes) 9. (07:30) From zero to cloud deployment Hero 10. (08:28) What's Next (LLM ethic benchmark) 11. (09:55) Outro. Show Notes * My local LLM GitRepo: Ollama_lab [https://github.com/brokedba/ollama_lab] * Helm leaderboard for model safety: Sandford Helm model leaderboard [https://crfm.stanford.edu/helm/classic/latest/#/leaderboard] * My talks in Oracle cloud world 2024: OCW2024LLM [https://bit.ly/OCW2024LLM]

Kommentare

0

Sei die erste Person, die kommentiert

Melde dich jetzt an und werde Teil der Tech Beats Unplugged-Community!

Loslegen

2 Monate fรผr 1ย โ‚ฌ

Dann 4,99ย โ‚ฌ / Monat ยท Jederzeit kรผndbar.

  • Podcasts nur bei Podimo
  • 20 Stunden Hรถrbรผcher / Monat
  • Alle kostenlosen Podcasts

Alle Folgen

10 Folgen

Episode ๐—ง๐—ต๐—ฒ ๐—œ๐—ป๐—ด๐—น๐—ผ๐—ฟ๐—ถ๐—ผ๐˜‚๐˜€ ๐—–๐—น๐—ผ๐˜‚๐—ฑ ๐—ง๐—ฟ๐—ฎ๐—ฝ๐˜€ โ˜๏ธ| Ep 09 Cover

๐—ง๐—ต๐—ฒ ๐—œ๐—ป๐—ด๐—น๐—ผ๐—ฟ๐—ถ๐—ผ๐˜‚๐˜€ ๐—–๐—น๐—ผ๐˜‚๐—ฑ ๐—ง๐—ฟ๐—ฎ๐—ฝ๐˜€ โ˜๏ธ| Ep 09

๐Ÿ‘จ๐Ÿฝโ€๐Ÿš€ Welcome to Episode 09 of "Tech Beats unplugged" ๐—œ๐—ป๐—ด๐—น๐—ผ๐—ฟ๐—ถ๐—ผ๐˜‚๐˜€ ๐—–๐—น๐—ผ๐˜‚๐—ฑ ๐—ง๐—ฟ๐—ฎ๐—ฝ๐˜€ โ˜๏ธ" With Mark Boost [https://www.linkedin.com/in/markboost/] CEO at Civo [https://www.linkedin.com/company/civocloud/] Cloud . ๐—œ๐˜€ ๐˜๐—ต๐—ฒ ๐—–๐—น๐—ผ๐˜‚๐—ฑ ๐—ฏ๐—ฟ๐—ผ๐—ธ๐—ฒ๐—ป ...๐—ผ๐—ฟ ๐—ท๐˜‚๐˜€๐˜ ๐˜‚๐—ป๐—ณ๐—ฎ๐—ถ๐—ฟ? This time, Mark Boost [https://www.linkedin.com/in/markboost/] and Kosseila H. go deep into the infamous ๐—–๐—น๐—ผ๐˜‚๐—ฑ ๐˜๐—ฟ๐—ฎ๐—ฝ๐˜€ and explore where control is lost - and how to regained it for customers. ๐ŸŒŸ Weโ€™re so excited to share our latest tech Beats show with you! Topics discussed: 1. (00:23) Introduction 2. (01:42) From Formula 3 to serial entrepreneur 3. (08:44) lessons from s๐—ฐ๐—ฎ๐—น๐—ถ๐—ป๐—ด ๐—บ๐˜‚๐—น๐˜๐—ถ๐—ฝ๐—น๐—ฒ ๐—ฐ๐—ผ๐—บ๐—ฝ๐—ฎ๐—ป๐—ถ๐—ฒ๐˜€ (pivoting, competition, timing) 4. (18:59) CIVO Elevator pitch 5. (21:50) Why the ๐—ฐ๐—น๐—ผ๐˜‚๐—ฑ, ๐—ฎ๐˜€ ๐˜„๐—ฒ ๐—ธ๐—ป๐—ผ๐˜„ ๐—ถ๐˜, ๐—ถ๐˜€ ๐—ฏ๐—ฟ๐—ผ๐—ธ๐—ฒ๐—ป ? 6. (28:59) The infamous cloud traps: egress fees, exit tax 7. (37:50) startup FREE credit : what's the catch? 8. (44:02) Cloud regulation & fairness: blessing or curse? 9. (47:32) Is Regulation killing AI innovation in Europe ? 10. (51:31) David vs Goliath: independent providers vs the Big 3 hyperscalers "the MOAT" 11. (56:56) ๐—”๐—œ ๐—ฒ๐—ป๐—ฒ๐—ฟ๐—ด๐˜† ๐—ฐ๐—ฟ๐—ถ๐˜€๐—ถ๐˜€: how do we power AI without torching the grid๐Ÿ‘จ๐Ÿปโ€๐Ÿ”ง๐Ÿญ? 12. (01:01) ESG vs Greenwashing? 13. (01:03) ๐—ข๐˜‚๐˜๐—ฎ๐—ด๐—ฒ ๐—ฐ๐—ต๐—ฎ๐—ผ๐˜€: CEO mistakes โ€” and how to do it right (CrowdStrike case) ๐—ฆ๐—ต๐—ผ๐˜„ ๐—ก๐—ผ๐˜๐—ฒ๐˜€ * Civo website : civo.com [civo.com] * Egress Fees: Big Tech Braces For Digital Market Act [https://www.adexchanger.com/daily-news-roundup/tuesday-23012024/] * Civo talk "The cloud is broken": ย Theย Cloudย isย Brokenย  [https://www.youtube.com/watch?v=x1Oo3T9Mphw] * Digital Market Acts [https://ec.europa.eu/commission/presscorner/detail/en/ip_24_4761] * โ European Data Actโ โ  [https://digital-strategy.ec.europa.eu/en/policies/data-act] * Free credits:A Call for Fair Competition by Addressing the Impact of Free Cloud Credits [https://www.linkedin.com/pulse/call-fair-competition-simon-hansford-0ogbe/?trackingId=VDIuXnFTRmuIogz03rWqfg%3D%3D] * Green policies ESG: ESG reporting: Carbon in the cloud [https://www.cio.com/article/2517653/esg-reporting-carbon-in-the-cloud.html] * Data Center Co2 footprint calculator [https://8billiontrees.com/carbon-offsets-credits/carbon-ecological-footprint-calculators/carbon-footprint-of-data-centers/] ๐ŸŽ™๐—”๐—ฏ๐—ผ๐˜‚๐˜ ๐— ๐—ฎ๐—ฟ๐—ธ ๐—•๐—ผ๐—ผ๐˜€๐˜:โ โ โ โ โ โ โ โ Website: โ โ https://markboost.com/ [https://www.youtube.com/redirect?event=video_description&redir_token=QUFFLUhqbFFzZm9CbnM0ZWpjZ3hsZGd3MkkyTHh1U3U0UXxBQ3Jtc0ttS1k4SnA3S25lWXhvQTN0c2dPcW01V19pRlhPd2JJd2E5aXhVTmo5OWV5ZDJyT2xKUTB3RGJzRm5oTlRTSi0wMnl6RmRKRkN2ZlFvY2xUaHljSzRRLXliZ3lYakVJR0pyTEExa1BVUHhmaFkzRFNSUQ&q=https%3A%2F%2Fmarkboost.com%2F&v=ByjQhnLznO0]LinkedIn: โ โ  โ markboostย ย  [https://www.linkedin.com/in/markboost]X(Twitter): โ @markboost10 tips for a successful career [markboost.com/2024/08/22/10-tips-for-a-successful-career/ ]

17. Feb. 20261 h 11 min
Episode ๐—ก๐˜‚๐˜๐—ฎ๐—ป๐—ถ๐˜… ๐—ถ๐—ป ๐˜๐—ต๐—ฒ ๐—”๐—ด๐—ฒ ๐—ผ๐—ณ ๐˜๐—ต๐—ฒ ๐—ฉ๐—ถ๐—ฟ๐˜๐˜‚๐—ฎ๐—น๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ฆ๐—ต๐—ฎ๐—ธ๐—ฒ-๐—จ๐—ฝ (w/ Michael webster) | Ep08 Cover

๐—ก๐˜‚๐˜๐—ฎ๐—ป๐—ถ๐˜… ๐—ถ๐—ป ๐˜๐—ต๐—ฒ ๐—”๐—ด๐—ฒ ๐—ผ๐—ณ ๐˜๐—ต๐—ฒ ๐—ฉ๐—ถ๐—ฟ๐˜๐˜‚๐—ฎ๐—น๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ฆ๐—ต๐—ฎ๐—ธ๐—ฒ-๐—จ๐—ฝ (w/ Michael webster) | Ep08

๐Ÿ‘จ๐Ÿฝโ€๐Ÿš€ Welcome to Episode 08 of "Tech Beats unplugged" ๐ŸŒ๐Ÿฆ˜This time the episode cuts through the ๐—ฉ๐— ๐˜„๐—ฎ๐—ฟ๐—ฒ ๐—•๐—ฟ๐—ผ๐—ฎ๐—ฑ๐—ฐ๐—ผ๐—บ chaos and breaks down where Nutanix really fits in todayโ€™s virtualized/multicloud world. With the help of no other than Nutanix Guru Michael Webster all the way from News Zeeland ๐ŸŽง Weโ€™re so excited to share our latest tech Beats show with you๐Ÿงก! Please share away ๐Ÿค— We hope you'll enjoy it!!! Topics discussed: 1. (00:21) Introduction 2. (01:58) What's your story with Nutanix 3. (09:41) Hyper Converged Infrastructure. What's that? 4. (13:38) Understanding Nutanix. 5. (18:25) The VMwareโ€“Broadcom Chaos for customers 6. (26:00) Running VMware on Nutanix (and moving off) 7. (30:05) Nutanix hardware ... a Lego of Nodes 8. (36:20) Nutanix building blocks - What's Controller VM 9. (43:05) Nutanix ๐—ธ๐—ถ๐—น๐—น๐˜€ ๐˜๐—ต๐—ฒ ๐—น๐—ฒ๐—ด๐—ฎ๐—ฐ๐˜† ๐—ฅ๐—”๐—œ๐—— storage - how? 10. (52:04) Nutanix; from ๐—ต๐—ฎ๐—ฟ๐—ฑ๐˜„๐—ฎ๐—ฟ๐—ฒ ๐˜๐—ผ ๐˜€๐—ผ๐—ณ๐˜๐˜„๐—ฎ๐—ฟ๐—ฒ company 11. (57:00) What is Nutanix MOAT going forward ? Show Notes * VMware customers face up to 1,500% price increase Newsโ  [https://www.networkworld.com/article/3994107/vmware-customers-in-europe-face-up-to-1500-price-increases-under-broadcom-ownership.html] * โ Nutanix How it works: โ โ โ Video [https://www.youtube.com/watch?v=IKdzcwMY950] * Dutch court forces Broadcom to support VMware migration after 85% price hike: โ Networkworldโ  [https://www.networkworld.com/article/4015489/dutch-court-forces-broadcom-to-support-vmware-migration-after-85-price-hike-backlash.html] * Nutanix Bible (nutanixbible.com [https://nutanixbible.com/])โ  [https://github.com/JamesWoolfenden/ghat] * โ AT&T sues broadcom over licensing: โ โ โ  [https://adnanthekhan.com/2024/12/21/cacheract-the-monster-in-your-build-cache/]โ Forbes [https://www.forbes.com/sites/stevemcdowell/2024/09/06/why-atts-suing-broadcom-over-forced-vmware-license-changes/] * Nutanix AHV: โ Built-in Enterprise-Class Virtualization [https://youtu.be/80UiJp0i7K0]โ  ๐ŸŽ™About Michael Webster: * โ โ โ โ โ โ  [https://www.linkedin.com/in/erjosito]Website: โ longwhiteclouds.comโ  [https://longwhiteclouds.com ] * LinkedIn: โ Steve Giguereโ  [https://www.linkedin.com/in/stevegiguere/] * X(Twitter): @vcdxnz001 [https://x.com/vcdxnz001] Brought to You by Cloudthrill [https://cloudthrill.ca].

23. Dez. 20251 h 3 min
Episode ๐Ÿ”ดTechBeats live : LLM Quantization "vLLM vs. Llama.cpp" | Ep07 Cover

๐Ÿ”ดTechBeats live : LLM Quantization "vLLM vs. Llama.cpp" | Ep07

๐Ÿ‘‹๐Ÿผ Hey AI heads ๐ŸŽ™๏ธ Join us for the very first Tech Beats Live ๐Ÿ”ด, hosted by Kosseilaโ€”aka @CloudDude from @CloudThrill. ๐ŸŽฏ This chill & laid-back livestream will unpack LLM quantization ๐Ÿ”ฅ: * โœ… WHY it matters * โœ… HOW it works * โœ… Enterprise (vLLM) vs Consumer (@Ollama) trade-offs * โœ… and WHERE itโ€™s going next. Weโ€™ll be joined by two incredible guest stars to talk Enterprise vs Consumer Quantz ๐Ÿ—ฃ๏ธ: ๐Ÿ”ท Eldar Kurtiฤ‡ โ€“ bringing the enterprise perspective with vLLM. ๐Ÿ”ท Colin Kealty โ€“ aka Bartowski, creator of the top-downloaded GGUF quantized LLMs on Hugging Face. ๐Ÿซต๐Ÿผ Come learn and have some fun ๐Ÿ˜Ž. ๐‚๐ก๐š๐ฉ๐ญ๐ž๐ซ๐ฌ: (00:00) Host Introduction (04:07) Eldar Intro (07:33) Bartowski Intro (13:04) Whatโ€™s Quantization! (16:19) Why LLM Quantization Matters? (20:39) Training vs Inference โ€“ โ€œThe New Dealโ€ (27:46) Biggest Misconception About Quantization (33:22) Enterprise Quantization in Production (vLLM) (48:48) Consumer LLMs & Quantization (Ollama, llama.cpp, GGUF) โ€“ โ€œLLMs for the Peopleโ€ (01:06:45) BitNet 1-Bit Quantization from Microsoft (01:28:14) How Long It Takes to Quantize a Model (Llama-3 70B) โ€“ GGUF or lm-compressor (01:34:23) What Is I-Matrix & Why People Confuse It with IQ Quantization? (01:39:36) Whatโ€™s LoRA & LoRA-Q? (01:42:36) What Is Sparsity? (01:47:42) What Is Distillation? (01:52:34) Extreme Quantization (Unsloth) of Big Models (DeepSeek) at 2-bits 70 % Size Cut (01:57:27) Will Future Models (Llama-5) Be Trained on FP4 Tensor Cores? (02:02:15) The Future of LLMs on Edge Devices (Google AI Edge) (02:08:00) How to Evaluate the Quality of a Quantized Model (02:26:09) Hugging Faceโ€™s Role in the World of LLM/Quantization (02:33:46) Hugging Faceโ€™s Role in the World of LLM/Quantization (02:36:41) LocalLlama Sub-Reddit Down (Moderator Goes Bananas) (02:40:11) Guestsโ€™ Hope for the Future of LLMs & AI in General ๐Ÿ“– Check out the quantization blog: https://bitly/LLMQuant [https://www.youtube.com/redirect?event=video_description&redir_token=QUFFLUhqbGZlNEtpUW1VZU9XUFhucG5Tbng2aUtabVNtQXxBQ3Jtc0tuYWNXVDBsaWpLaHVxRDhRZklqNjZhX0tTYWF6cFA2UC1VYU1OSW1Fc2N4SUF1WEJremZxZXc5U3hSMEZkLXhwUDV5Z3dCOFFXc2oyc1I1d1gtU2FuSVRhWE1jem12OGUxbnYzS2dITURQQVhaMHVwOA&q=https%3A%2F%2Fcloudthrill.ca%2Fllm-quantization-all-you-need-to-know&v=XTE0oS7b6fM] #AI #LLM #Quantization #TechBeatsLive #LocalLlama #vLLM #Ollama

19. Juli 20252 h 51 min
Episode Ep06: "GitHub Security horror stories " (withย Steveย Giguere) Cover

Ep06: "GitHub Security horror stories " (withย Steveย Giguere)

๐Ÿ‘จ๐Ÿฝโ€๐Ÿš€ Welcome to Episode 06 of "Tech Beats unplugged" This time, weโ€™re diving headfirst into ๐ญ๐ก๐ž ๐œ๐ซ๐š๐ณ๐ข๐ž๐ฌ๐ญ ๐†๐ข๐ญ๐‡๐ฎ๐› ๐ฌ๐ž๐œ๐ฎ๐ซ๐ข๐ญ๐ฒ ๐ฌ๐ญ๐จ๐ซ๐ข๐ž๐ฌ, and who better to join us than Steve Giguere, an industry veteran and security expert whoโ€™s seen it all. From supply chain security mayhem to GitHub Actions gone wrong, we uncover real-world security blunders, attack vectors, and best practices to keep your repos and workflows safe. ๐ŸŒŸ Weโ€™re so excited to share our latest tech Beats show with you๐Ÿงก! Please share away ๐Ÿค— We hope you'll enjoy it!!! Topics discussed: 1. (00:00) Introduction 2. (03:53) Software Supply Chain Security acronyms (SAST, DAST, IAST, etc.) 3. (09:15) โ€œA workflow is an application within your applicationโ€ - What does that mean?! 4. (12:16) Public vs. Private Repos - Are private orgs still at risk? 5. (18:27) Self-hosted runners: Safe or security nightmare? 6. (21:16) GitHub Environment Variables - How critical are they? 7. (22:55) Secrets, masks, and how secure they really are 8. (28:05) Artifact vs. Caching: Which is safer? 9. (31:27) Craziest GitHub security screw-ups Steve has ever seen ๐Ÿ”ฅ 10. (36:42) Common attack vectors in GitHub Actions 11. (44:19) Best security practices for GitHub Actions - Low-hanging fruit fixes ๐Ÿ 12. (50:22) Are public actions safe? Can they be scanned? 13. (53:52) xz backdoor fiasco - Lessons from the latest supply chain attack 14. (59:00) NVDโ€™s slowdown - Whatโ€™s at stake? Show Notes * CI/CD Goat (Deliberately vulnerable CI/CD environment): GitHub [https://github.com/cider-security-research/cicd-goat] * GitHub cache poisoning: Cacheract Attack [https://adnanthekhan.com/2024/12/21/cacheract-the-monster-in-your-build-cache/] | ScribeSecurity [https://scribesecurity.com/blog/github-cache-poisoning/] * Your GitHub Secrets in Plain Text: CloudThrill [https://cloudthrill.ca/your-github-secrets-aint-that-secret] * Ghat tool (Updating dependencies in GitHub Actions): GitHub [https://github.com/JamesWoolfenden/ghat] * OpenSSF Scorecard: Website [https://scorecard.dev/] * The GitHub Worm (Asi Greenholts): Palo Alto Blog [https://www.paloaltonetworks.com/blog/prisma-cloud/github-actions-worm-dependencies/] * OWASP Top 10 CI/CD Risks: OWASP [https://owasp.org/www-project-top-10-ci-cd-security-risks/] * Heartbleed OpenSSL Exploit: Wikipedia [https://en.wikipedia.org/wiki/Heartbleed] ๐ŸŽ™About Steve Giguere: * โ โ โ โ  [https://www.linkedin.com/in/erjosito]Website: stevegiguere.com [https://stevegiguere.com] * LinkedIn: Steve Giguere [https://www.linkedin.com/in/stevegiguere/] * Book: Cloud Native Application Protection Platforms โ€“ O'Reilly [https://www.oreilly.com/library/view/cloud-native-application/9781098141691/] * Personal Blog: Codifyre [https://codifyre.com/] * Talk Lessons Learned from OSS and GitOps Journey: YouTube [https://youtu.be/xH_wzHwwQho] * OWASP Lisbon Talk: YouTube [https://youtu.be/-WxtnUHhrlc?feature=shared] * StayWiredIn YouTube Show: StayWiredIn [https://www.youtube.com/@staywiredin] * DevSecOps Podcast: Spotify [https://open.spotify.com/show/0XVk0AKg26yLTCMMwkIA7m]

10. Juni 20251 h 5 min
Episode Ep05: "Deploy Local LLMs ๐ข๐ง the Cloud (๐Ÿ๐ŸŽ๐ŸŽ% ๐ƒ๐š๐ญ๐š ๐๐ซ๐ข๐ฏ๐š๐œ๐ฒ)" Cover

Ep05: "Deploy Local LLMs ๐ข๐ง the Cloud (๐Ÿ๐ŸŽ๐ŸŽ% ๐ƒ๐š๐ญ๐š ๐๐ซ๐ข๐ฏ๐š๐œ๐ฒ)"

๐Ÿ‘จ๐Ÿฝโ€๐Ÿš€ Welcome to Episode 05 of "Tech Beats unplugged" This time, we tried something completely crazy โ€“ we're letting the AI hosts take over! That's right ๐Ÿ˜Ž. We're flipping the script and giving the AI the mic to guide us through the fascinating world of local LLMs. but that's not all as this episode is actually inspired by my recent talk at Oracle Cloud World in Vegas. The topic? You guessed it: Local LLMs in the cloud. ๐ŸŒŸ Weโ€™re so excited to share our latest tech Beats show with you๐Ÿงก! We hope you'll enjoy it!!! Topics discussed: 1. (00:00) Introduction 2. (01:00) Why OpenAI Might Not Be Your BFF? 3. (02:40) Local/Open LLMs to the Rescue! 4. (03:38) What's Quantization! 5. (04:30) Where to find these Open LLMs? 6. (05:02) Inference Engines (Ollama)! 7. (05:50) What's a modelfile! 8. (06:40) What about deploying local AI to the cloud?(OKE/managed kubernetes) 9. (07:30) From zero to cloud deployment Hero 10. (08:28) What's Next (LLM ethic benchmark) 11. (09:55) Outro. Show Notes * My local LLM GitRepo: Ollama_lab [https://github.com/brokedba/ollama_lab] * Helm leaderboard for model safety: Sandford Helm model leaderboard [https://crfm.stanford.edu/helm/classic/latest/#/leaderboard] * My talks in Oracle cloud world 2024: OCW2024LLM [https://bit.ly/OCW2024LLM]

24. Sept. 202411 min