Winners' Circle
Siobhan Hanna is helping AI companies protect one of the most important parts of model development: the human data layer. As a leader at WeLo Data, she works with foundational LLM builders and enterprise technology companies to provide high quality multilingual human data across languages, cultures, and markets. WeLo Data’s NEMO framework recently won an AI Excellence Award for helping detect fraud, misrepresentation, and data integrity risks in AI training pipelines. In this episode, Russ and Siobhan explore why high quality human data is essential to building better AI models, and why that data is increasingly vulnerable to fraud. Siobhan explains how contractor based, globally distributed AI data workflows can create opportunities for identity fraud, coordinated manipulation, account sharing, and other risks that can degrade model performance. They dive into NEMO, WeLo Data’s fraud mitigation and misrepresentation detection framework. Siobhan shares how the system uses continuous monitoring, behavioral analytics, rules based logic, AI driven detection, and organizational psychology to identify suspicious activity across the contributor life cycle. The conversation also covers why AI data integrity should be treated as part of the broader data quality and governance conversation. Siobhan explains why point in time checks are not enough, how WeLo Data borrowed ideas from financial services and KYC models, and why continuous monitoring is critical when training data is so strategically valuable. Along the way, Siobhan discusses multilingual AI, cultural context, data provenance, contributor verification, regulatory trends, and why protecting the human layer of AI development may soon move from best practice to formal requirement. Topics Covered: [00:01] Welcome and intro, Siobhan Hanna and WeLo Data’s AI Excellence Award win [00:28] WeLo Data’s role as a multilingual AI human data provider [01:05] Why AI training data quality matters [01:24] How fraud can enter human data workflows [02:29] Why fraud mitigation in AI data has been underserved [02:36] The speed of AI development and the blind spot around human data integrity [04:28] How fraudulent or misrepresented data can affect model performance [04:57] Why data integrity issues can be hard to trace after model degradation [06:08] Why fraud is difficult to detect in global AI data pipelines [07:02] Which AI systems are most exposed to training data integrity risks [08:10] Identity validation and why AI data fraud differs from traditional fraud [08:35] Borrowing KYC and transaction monitoring ideas from financial services [10:27] How WeLo Data validates that NEMO is catching the right activity [11:24] Behavioral variables, rules based detection, and AI driven monitoring [13:04] The role of organizational psychology in fraud detection [13:53] Stopping threats before they reach the model [14:28] What surprised WeLo Data about the AI fraud landscape [15:30] Why multilingual and cultural context make fraud detection harder [17:02] Why continuous monitoring beats one time screening [18:04] What translated from financial services and what had to be reinvented [19:20] AI regulation, data integrity, and governance requirements [19:48] Why contributor verification may become a formal AI requirement [20:50] Why data provenance should be part of responsible AI infrastructure [21:23] Questions AI companies should ask about who produced their data [22:43] Which parts of AI infrastructure are most vulnerable [23:04] Advice for AI founders, operators, and leaders [23:53] Final thoughts on fraud, trust, and protecting AI training data
73 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y forma parte de la comunidad de Winners' Circle!