{"id":20348,"date":"2024-03-06T05:03:22","date_gmt":"2024-03-06T05:03:22","guid":{"rendered":"https:\/\/interface.media\/?p=20348"},"modified":"2024-03-06T11:13:47","modified_gmt":"2024-03-06T11:13:47","slug":"why-is-multimodal-ai-such-a-big-deal","status":"publish","type":"post","link":"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/","title":{"rendered":"Why is multimodal AI such a big deal?\u00a0\u00a0"},"content":{"rendered":"\n<p>Generative artificial intelligence (AI) has arrived. However, if 2022 was the year that generative AI exploded into the public consciousness, 2023 was the year the money started rolling in. Now, 2024 is the year when investors start to scrutinise their returns. PitchBook estimates that generative AI startups raised about <a href=\"https:\/\/siliconangle.com\/2023\/12\/27\/pitchbook-tech-giants-invested-generative-ai-startups-vcs-year\/#:~:text=The%20market%20intelligence%20company's%20findings,billion%20from%20investors%20in%202023.\">$27 billion<\/a> from investors last year. OpenAI alone was projected to rake in as much as $1 billion in revenue in 2024, <a href=\"https:\/\/www.reuters.com\/technology\/generative-ais-wild-2023-2024-01-03\/\">according to Reuters<\/a>.<\/p>\n\n\n\n<p>This year, then, is the year that AI takes all-important steps towards maturity. If generative AI is to deliver on its promises, it needs to develop new capabilities and find real-world applications.<\/p>\n\n\n\n<p>Currently, it looks like multimodal AI is going to be the next true step-change in what the technology can deliver. If investor are right, multimodal AI will deliver the kind of universal input to universal output functionality that would make Generative AI commercially viable. <\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-what-is-multimodal-ai-nbsp\">What is multimodal AI?&nbsp;<\/h3>\n\n\n\n<p>A multimodal AI model is a form of machine learning that can process information from different \u201cmodalities\u201d. This includes images, videos, and text. They can then, theoretically, spit out results in a variety of formats as well.&nbsp;<\/p>\n\n\n\n<p>For example, an AI with a multimodal machine meaning model at its core could be fed a picture of a cake and generate a written recipe as a response and vice versa.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-why-is-multimodal-ai-a-big-deal-nbsp\">Why is multimodal AI a big deal?&nbsp;<\/h3>\n\n\n\n<p>Multimodal models represent the next big step forward in how developers enhance AI for future applications.&nbsp;<\/p>\n\n\n\n<p>For instance, according to Google, its Gemini AI can understand and generate high-quality code in popular languages like Python, Java, C++, and Go, freeing up developers to create more feature-rich apps. This code could be generated in response to anything from simple images to a voice note.&nbsp;<\/p>\n\n\n\n<p><a href=\"https:\/\/cloud.google.com\/use-cases\/multimodal-ai\">According to Google<\/a>, this brings us closer to AI that acts less like software and more like an expert assistant.<\/p>\n\n\n\n<p>\u201cMultimodality has the power to create more human-like experiences that can better take advantage of the range of senses we use as humans, such as sight, speech and hearing,\u201d <a href=\"https:\/\/news.microsoft.com\/three-big-ai-trends-to-watch-in-2024\/\">says Jennifer Marsman<\/a>, principal engineer for Microsoft\u2019s Office of the Chief Technology Officer, Kevin Scott.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Able to understand multiple types of input, multi-modal models represent the next big step in generative AI refinement. <\/p>\n","protected":false},"author":480,"featured_media":20349,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"apple_news_api_created_at":"2024-03-06T05:03:26Z","apple_news_api_id":"10caaf22-d93b-4c2e-ba4d-d0ba8a9f7165","apple_news_api_modified_at":"2024-03-06T11:13:45Z","apple_news_api_revision":"AAAAAAAAAAAAAAAAAAAAAQ==","apple_news_api_share_url":"https:\/\/apple.news\/AEMqvItk7TC66TdC6ip9xZQ","apple_news_cover_media_provider":"image","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_cover_video_id":0,"apple_news_cover_video_url":"","apple_news_cover_embedwebvideo_url":"","apple_news_is_hidden":"","apple_news_is_paid":"","apple_news_is_preview":"","apple_news_is_sponsored":"","apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":[],"apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[3],"tags":[],"topic":[614],"class_list":["post-20348","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-the-interface","topic-data-ai"],"acf":[],"apple_news_notices":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v26.6 (Yoast SEO v26.6) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Why is multimodal AI such a big deal?\u00a0\u00a0 - Interface<\/title>\n<meta name=\"description\" content=\"Able to understand multiple types of input, multi-modal models represent the next big step in generative AI refinement.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Why is multimodal AI such a big deal?\u00a0\u00a0\" \/>\n<meta property=\"og:description\" content=\"Able to understand multiple types of input, multi-modal models represent the next big step in generative AI refinement.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/\" \/>\n<meta property=\"og:site_name\" content=\"Interface\" \/>\n<meta property=\"article:published_time\" content=\"2024-03-06T05:03:22+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-03-06T11:13:47+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/03\/iStock-1401567171.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1374\" \/>\n\t<meta property=\"og:image:height\" content=\"763\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Dan Brightmore\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Dan Brightmore\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/\",\"url\":\"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/\",\"name\":\"Why is multimodal AI such a big deal?\u00a0\u00a0 - Interface\",\"isPartOf\":{\"@id\":\"https:\/\/interface.media\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/03\/iStock-1401567171.jpg\",\"datePublished\":\"2024-03-06T05:03:22+00:00\",\"dateModified\":\"2024-03-06T11:13:47+00:00\",\"author\":{\"@id\":\"https:\/\/interface.media\/#\/schema\/person\/7c33499ca8e42b097028109cccb22748\"},\"description\":\"Able to understand multiple types of input, multi-modal models represent the next big step in generative AI refinement.\",\"breadcrumb\":{\"@id\":\"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#primaryimage\",\"url\":\"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/03\/iStock-1401567171.jpg\",\"contentUrl\":\"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/03\/iStock-1401567171.jpg\",\"width\":1374,\"height\":763,\"caption\":\"Cover design template. Side view of dotted face background. Geometric pattern. 3D vector illustration for brochure, poster, card, invitation, poster, textile print, presentation, flyer or banner.\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/interface.media\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Why is multimodal AI such a big deal?\u00a0\u00a0\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/interface.media\/#website\",\"url\":\"https:\/\/interface.media\/\",\"name\":\"Interface\",\"description\":\"Delivering World Class Content \u201cFrom Executive, For Executive\u201c\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/interface.media\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/interface.media\/#\/schema\/person\/7c33499ca8e42b097028109cccb22748\",\"name\":\"Dan Brightmore\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/interface.media\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e9ca282f0ef431735a64685769ad57886e24b074c4c58314392755fb79164164?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e9ca282f0ef431735a64685769ad57886e24b074c4c58314392755fb79164164?s=96&d=mm&r=g\",\"caption\":\"Dan Brightmore\"},\"url\":\"https:\/\/interface.media\/blog\/author\/dbrightmore\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Why is multimodal AI such a big deal?\u00a0\u00a0 - Interface","description":"Able to understand multiple types of input, multi-modal models represent the next big step in generative AI refinement.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_GB","og_type":"article","og_title":"Why is multimodal AI such a big deal?\u00a0\u00a0","og_description":"Able to understand multiple types of input, multi-modal models represent the next big step in generative AI refinement.","og_url":"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/","og_site_name":"Interface","article_published_time":"2024-03-06T05:03:22+00:00","article_modified_time":"2024-03-06T11:13:47+00:00","og_image":[{"width":1374,"height":763,"url":"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/03\/iStock-1401567171.jpg","type":"image\/jpeg"}],"author":"Dan Brightmore","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Dan Brightmore","Estimated reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/","url":"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/","name":"Why is multimodal AI such a big deal?\u00a0\u00a0 - Interface","isPartOf":{"@id":"https:\/\/interface.media\/#website"},"primaryImageOfPage":{"@id":"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#primaryimage"},"image":{"@id":"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#primaryimage"},"thumbnailUrl":"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/03\/iStock-1401567171.jpg","datePublished":"2024-03-06T05:03:22+00:00","dateModified":"2024-03-06T11:13:47+00:00","author":{"@id":"https:\/\/interface.media\/#\/schema\/person\/7c33499ca8e42b097028109cccb22748"},"description":"Able to understand multiple types of input, multi-modal models represent the next big step in generative AI refinement.","breadcrumb":{"@id":"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#primaryimage","url":"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/03\/iStock-1401567171.jpg","contentUrl":"https:\/\/interface.media\/wp-content\/uploads\/sites\/3\/2024\/03\/iStock-1401567171.jpg","width":1374,"height":763,"caption":"Cover design template. Side view of dotted face background. Geometric pattern. 3D vector illustration for brochure, poster, card, invitation, poster, textile print, presentation, flyer or banner."},{"@type":"BreadcrumbList","@id":"https:\/\/interface.media\/blog\/2024\/03\/06\/why-is-multimodal-ai-such-a-big-deal\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/interface.media\/"},{"@type":"ListItem","position":2,"name":"Why is multimodal AI such a big deal?\u00a0\u00a0"}]},{"@type":"WebSite","@id":"https:\/\/interface.media\/#website","url":"https:\/\/interface.media\/","name":"Interface","description":"Delivering World Class Content \u201cFrom Executive, For Executive\u201c","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/interface.media\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/interface.media\/#\/schema\/person\/7c33499ca8e42b097028109cccb22748","name":"Dan Brightmore","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/interface.media\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e9ca282f0ef431735a64685769ad57886e24b074c4c58314392755fb79164164?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e9ca282f0ef431735a64685769ad57886e24b074c4c58314392755fb79164164?s=96&d=mm&r=g","caption":"Dan Brightmore"},"url":"https:\/\/interface.media\/blog\/author\/dbrightmore\/"}]}},"_links":{"self":[{"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/posts\/20348","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/users\/480"}],"replies":[{"embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/comments?post=20348"}],"version-history":[{"count":3,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/posts\/20348\/revisions"}],"predecessor-version":[{"id":20373,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/posts\/20348\/revisions\/20373"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/media\/20349"}],"wp:attachment":[{"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/media?parent=20348"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/categories?post=20348"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/tags?post=20348"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/interface.media\/wp-json\/wp\/v2\/topic?post=20348"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}