{"id":21057,"date":"2024-06-19T15:01:32","date_gmt":"2024-06-19T15:01:32","guid":{"rendered":"https:\/\/interface.media\/?p=21057"},"modified":"2024-06-19T15:01:39","modified_gmt":"2024-06-19T15:01:39","slug":"better-not-bigger-the-ai-data-quality-crisis","status":"publish","type":"post","link":"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/","title":{"rendered":"Better, not bigger: the AI data quality crisis\u00a0\u00a0"},"content":{"rendered":"\n<p>It\u2019s neither new nor controversial to say that the world runs on data. Big data analytics are fundamental to maintaining agility and visibility. This is not to mention unlocking valuable insights that let orangisations stay competitive. Globally, the big data market is expected to grow to more than <a href=\"https:\/\/edgedelta.com\/company\/blog\/what-percentage-of-company-invest-in-big-data#:~:text=The%20global%20big%20data%20market,corporate%20investment%20and%20innovation%20strategies.\">$401 billion by the end of 2028<\/a>\u2014up from $220 billion last year.\u00a0<\/p>\n\n\n\n<p>Business leaders can pretty much universally agree that data is undeniably important. However, actually leveraging that data into impactful business outcomes remains a huge challenge for a lot of companies. Increasingly, focusing on the volume and variety of data alone leaves organisations without the one thing they really need: data they can trust.\u00a0<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-data-quality-not-just-quantity-nbsp\">Data quality, not just quantity&nbsp;<\/h3>\n\n\n\n<p>No matter how sophisticated the analytical tool, the quality of data that goes in determines the quality of insight that comes out. Good quality data is data that is suitable for its intended use. Poor quality data fails to meet this criterion. In other words, poor quality data cannot effectively support the outcomes it is being used to generate.<\/p>\n\n\n\n<p>Raw data often falls into the category of poor quality data. For instance, data collected from social media platforms like Twitter is unstructured. In this raw form, it isn\u2019t particularly useful for analysis or other valuable applications. Nonetheless, raw data can be transformed into good quality data through data cleaning and processing, which typically requires time.<\/p>\n\n\n\n<p>Some bad data, however, is simply inaccurate, misleading, or fundamentally flawed. It can\u2019t be easily refined into anything useful, and its presence in a data set can spoil any results. Data that lacks structure or has issues such as inaccuracy, incompleteness, inconsistencies, and duplication is considered poor quality data.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-is-ai-solving-the-problem-or-creating-it-nbsp\">Is AI solving the problem or creating it?&nbsp;<\/h3>\n\n\n\n<p>Concerns over data quality are as old at spreadsheets and maybe even the abacus. Managing, structuring, and creating insights from data only gets more complicated the more data you gather, and organisations today gather a frighteningly large amount of data as a matter of course.They might not be able to do anything with it, but everyone knows that data is valuable, so organisations take a more is more approach and hoover up as much as they can.&nbsp;&nbsp;<\/p>\n\n\n\n<p>New tools like generative artificial intelligence (AI) promise to help companies capture the value present in their data. The technology exploded onto the scene, promising rapid and sophisticated data analysis. Now, questionable inputs are being blamed for the hallucinations and other odd behaviours that very publicly undermined LLMs&#8217; effectiveness. The current debacle with Google\u2019s AI-assisted search being <a href=\"https:\/\/interface.media\/blog\/2024\/04\/18\/the-next-generation-of-generative-ai-will-be-trained-on-reddit-threads-and-tumblr-posts\/\">trained on reddit posts<\/a> is a perfect example.\u00a0<\/p>\n\n\n\n<p>However, AI has also been criticised for muddying the waters and further degrading the quality of data available.&nbsp;<\/p>\n\n\n\n<p>\u201cHow can we trust all our data in <a href=\"https:\/\/interface.media\/blog\/2024\/06\/13\/is-the-ai-bubble-set-to-burst\/\">the generative AI economy<\/a>?\u201d asks Tuna Yemisci, regional director of Middle East, Africa and East Med at Qlik in <a href=\"https:\/\/it-online.co.za\/2024\/05\/28\/from-big-data-to-better-data\/\">a recent article<\/a>. The trend isn\u2019t going away either, with reports coming out earlier this year that observe <a href=\"https:\/\/www.datanami.com\/2024\/04\/05\/data-quality-getting-worse-report-says\/\">data quality getting worse<\/a>. A survey by dbt Labs found in April that poor data quality was the number one concern of the 456 analytics engineers, data engineers, data analysts, and other data professionals who took the survey.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-the-feedback-loop-nbsp\">The feedback loop&nbsp;<\/h3>\n\n\n\n<p>Not only is AI undermining the quality of existing data, but bad existing data is undermining attempts to find applications for generative AI. The whole issue is in danger of creating a feedback loop that undermines the tech industry\u2019s biggest bets for the future of digital economic activity.&nbsp;<\/p>\n\n\n\n<p>\u201cThere\u2019s a common assumption that the data (companies) have accumulated over the years is AI-ready, but that\u2019s not the case,\u201d Joseph Ours, a Partner at Centric Consulting wrote in a recent blog post. \u201cThe reality is that no one has truly AI-ready data, at least not yet\u2026 Rushing into AI projects with incomplete data can be a recipe for disappointment. The power of AI lies in its ability to find patterns and insights humans might overlook. But if the necessary data is unavailable, even the most sophisticated AI cannot generate the insights organisations want most.\u201d<\/p>\n","protected":false},"excerpt":{"rendered":"<p>No one doubts the value of data, but inaccurate, low quality, poorly organised data is a growing problem for organisations across multiple industries. <\/p>\n","protected":false},"author":480,"featured_media":21058,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"apple_news_api_created_at":"2024-06-19T15:01:36Z","apple_news_api_id":"f7da3b8e-8a8f-4d8a-8744-86366fb79b79","apple_news_api_modified_at":"2024-06-19T15:01:36Z","apple_news_api_revision":"AAAAAAAAAAD\/\/\/\/\/\/\/\/\/\/w==","apple_news_api_share_url":"https:\/\/apple.news\/A99o7joqPTYqHRIY2b7ebeQ","apple_news_cover_media_provider":"image","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_cover_video_id":0,"apple_news_cover_video_url":"","apple_news_cover_embedwebvideo_url":"","apple_news_is_hidden":"","apple_news_is_paid":"","apple_news_is_preview":"","apple_news_is_sponsored":"","apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":[],"apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[3],"tags":[],"topic":[614],"class_list":["post-21057","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-the-interface","topic-data-ai"],"acf":[],"apple_news_notices":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v26.6 (Yoast SEO v26.6) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Better, not bigger: the AI data quality crisis\u00a0\u00a0 - Interface<\/title>\n<meta name=\"description\" content=\"No one doubts the value of data, but inaccurate, low quality, poorly organised data is a growing problem for organisations.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Better, not bigger: the AI data quality crisis\u00a0\u00a0\" \/>\n<meta property=\"og:description\" content=\"No one doubts the value of data, but inaccurate, low quality, poorly organised data is a growing problem for organisations.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/\" \/>\n<meta property=\"og:site_name\" content=\"Interface\" \/>\n<meta property=\"article:published_time\" content=\"2024-06-19T15:01:32+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-06-19T15:01:39+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/06\/iStock-1694201673-scaled.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1441\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Dan Brightmore\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Dan Brightmore\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/\",\"url\":\"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/\",\"name\":\"Better, not bigger: the AI data quality crisis\u00a0\u00a0 - Interface\",\"isPartOf\":{\"@id\":\"https:\/\/interface.media\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/06\/iStock-1694201673-scaled.jpg\",\"datePublished\":\"2024-06-19T15:01:32+00:00\",\"dateModified\":\"2024-06-19T15:01:39+00:00\",\"author\":{\"@id\":\"https:\/\/interface.media\/#\/schema\/person\/7c33499ca8e42b097028109cccb22748\"},\"description\":\"No one doubts the value of data, but inaccurate, low quality, poorly organised data is a growing problem for organisations.\",\"breadcrumb\":{\"@id\":\"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#primaryimage\",\"url\":\"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/06\/iStock-1694201673-scaled.jpg\",\"contentUrl\":\"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/06\/iStock-1694201673-scaled.jpg\",\"width\":2560,\"height\":1441,\"caption\":\"No one doubts the value of data, but inaccurate, low quality, poorly organised data is a growing problem for organisations across multiple industries.\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/interface.media\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Better, not bigger: the AI data quality crisis\u00a0\u00a0\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/interface.media\/#website\",\"url\":\"https:\/\/interface.media\/\",\"name\":\"Interface\",\"description\":\"Delivering World Class Content \u201cFrom Executive, For Executive\u201c\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/interface.media\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/interface.media\/#\/schema\/person\/7c33499ca8e42b097028109cccb22748\",\"name\":\"Dan Brightmore\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/interface.media\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e9ca282f0ef431735a64685769ad57886e24b074c4c58314392755fb79164164?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e9ca282f0ef431735a64685769ad57886e24b074c4c58314392755fb79164164?s=96&d=mm&r=g\",\"caption\":\"Dan Brightmore\"},\"url\":\"https:\/\/interface.media\/blog\/author\/dbrightmore\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Better, not bigger: the AI data quality crisis\u00a0\u00a0 - Interface","description":"No one doubts the value of data, but inaccurate, low quality, poorly organised data is a growing problem for organisations.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_GB","og_type":"article","og_title":"Better, not bigger: the AI data quality crisis\u00a0\u00a0","og_description":"No one doubts the value of data, but inaccurate, low quality, poorly organised data is a growing problem for organisations.","og_url":"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/","og_site_name":"Interface","article_published_time":"2024-06-19T15:01:32+00:00","article_modified_time":"2024-06-19T15:01:39+00:00","og_image":[{"width":2560,"height":1441,"url":"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/06\/iStock-1694201673-scaled.jpg","type":"image\/jpeg"}],"author":"Dan Brightmore","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Dan Brightmore","Estimated reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/","url":"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/","name":"Better, not bigger: the AI data quality crisis\u00a0\u00a0 - Interface","isPartOf":{"@id":"https:\/\/interface.media\/#website"},"primaryImageOfPage":{"@id":"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#primaryimage"},"image":{"@id":"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#primaryimage"},"thumbnailUrl":"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/06\/iStock-1694201673-scaled.jpg","datePublished":"2024-06-19T15:01:32+00:00","dateModified":"2024-06-19T15:01:39+00:00","author":{"@id":"https:\/\/interface.media\/#\/schema\/person\/7c33499ca8e42b097028109cccb22748"},"description":"No one doubts the value of data, but inaccurate, low quality, poorly organised data is a growing problem for organisations.","breadcrumb":{"@id":"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#primaryimage","url":"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/06\/iStock-1694201673-scaled.jpg","contentUrl":"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/06\/iStock-1694201673-scaled.jpg","width":2560,"height":1441,"caption":"No one doubts the value of data, but inaccurate, low quality, poorly organised data is a growing problem for organisations across multiple industries."},{"@type":"BreadcrumbList","@id":"https:\/\/interface.media\/blog\/2024\/06\/19\/better-not-bigger-the-ai-data-quality-crisis\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/interface.media\/"},{"@type":"ListItem","position":2,"name":"Better, not bigger: the AI data quality crisis\u00a0\u00a0"}]},{"@type":"WebSite","@id":"https:\/\/interface.media\/#website","url":"https:\/\/interface.media\/","name":"Interface","description":"Delivering World Class Content \u201cFrom Executive, For Executive\u201c","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/interface.media\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/interface.media\/#\/schema\/person\/7c33499ca8e42b097028109cccb22748","name":"Dan Brightmore","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/interface.media\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e9ca282f0ef431735a64685769ad57886e24b074c4c58314392755fb79164164?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e9ca282f0ef431735a64685769ad57886e24b074c4c58314392755fb79164164?s=96&d=mm&r=g","caption":"Dan Brightmore"},"url":"https:\/\/interface.media\/blog\/author\/dbrightmore\/"}]}},"_links":{"self":[{"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/posts\/21057","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/users\/480"}],"replies":[{"embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/comments?post=21057"}],"version-history":[{"count":1,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/posts\/21057\/revisions"}],"predecessor-version":[{"id":21059,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/posts\/21057\/revisions\/21059"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/media\/21058"}],"wp:attachment":[{"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/media?parent=21057"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/categories?post=21057"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/tags?post=21057"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/topic?post=21057"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}