{"id":219742,"date":"2025-01-20T12:23:08","date_gmt":"2025-01-20T12:23:08","guid":{"rendered":"https:\/\/news.talkwithrattan.com\/index.php\/2025\/01\/20\/googles-new-titans-ai-architecture-could-give-llms-long-term-memory\/"},"modified":"2025-01-20T12:23:08","modified_gmt":"2025-01-20T12:23:08","slug":"googles-new-titans-ai-architecture-could-give-llms-long-term-memory","status":"publish","type":"post","link":"https:\/\/news.talkwithrattan.com\/index.php\/2025\/01\/20\/googles-new-titans-ai-architecture-could-give-llms-long-term-memory\/","title":{"rendered":"Google\u2019s New Titans AI Architecture Could Give LLMs &#8216;Long-Term Memory&#8217;"},"content":{"rendered":"<div style=\"text-align:center\"><img decoding=\"async\" src=\"https:\/\/i2.wp.com\/i.gadgets360cdn.com\/large\/google_logo1_reuters_1737368900674.jpg?ssl=1\" class=\"attachment-post-thumbnail size-post-thumbnail wp-post-image\" alt=\"Google\u2019s New Titans AI Architecture Could Give LLMs &#8216;Long-Term Memory&#8217;\" title=\"Google\u2019s New Titans AI Architecture Could Give LLMs &#8216;Long-Term Memory&#8217;\" \/><\/div><p> <br \/>\n<\/p>\n<div>\n<p><a href=\"https:\/\/www.gadgets360.com\/google\">Google<\/a> researchers unveiled a new artificial intelligence (AI) architecture last week that can enable large language models (LLMs) to remember the long-term context of events and topics. A paper was published by the Mountain View-based tech giant on the topic, and the researchers claim that AI models trained using this architecture displayed a more \u201chuman-like\u201d memory retention capability. Notably, Google ditched the traditional Transformer and Recurrent Neural Network (RNN) architectures to develop a new method to teach AI models how to remember contextual information.<\/p>\n<h2 id=\"google-s-titans-ai-architecture-unveiled\">Titans Can Scale AI Models&#8217; Context Window More Than 2 Million Tokens<\/h2>\n<p>The lead researcher of the project, Ali Behrouz, <a href=\"https:\/\/x.com\/behrouz_ali\/status\/1878859086227255347\" target=\"_blank\" rel=\"nofollow noopener\">posted<\/a> about the new architecture on X (formerly known as Twitter). He claimed that the new architecture provides a meta in-context memory with attention that teaches AI models how to remember the information at test-time compute.<\/p>\n<p>According to Google&#8217;s paper, which has been <a href=\"https:\/\/arxiv.org\/pdf\/2501.00663v1\" target=\"_blank\" rel=\"nofollow noopener\">published<\/a> in the pre-print online journal arXiv, the Titans architecture can scale the context window of AI models to larger than two million tokens. Memory has been a tricky problem to solve for AI developers.<\/p>\n<p>Humans remember information and events with context. If someone asked a person about what he wore last weekend, they would be able to remember additional contextual information, such as attending a birthday party of a person who they have known for the last 12 years.This way, when asked a follow-up question about why they wore a brown jacket and denim jeans last weekend, the person would be able to contextualise it with all these short-term and long-term information.<\/p>\n<p>AI models, on the other hand, typically use retrieval-augmented generation (RAG) systems, modified for Transformer and RNN architectures. It uses information as neural nodes. So, when an AI model has been asked a question, it accesses the particular node that contains the main information, as well as the nearby nodes that might contain additional or related information. However, once a query is solved, the information is removed from the system to save processing power.<\/p>\n<p>However, there are two downsides to this. First, an AI model cannot remember information in the long run. If one wanted to ask a follow-up question after a session was over, one would have to provide the full context again (unlike how humans function). Second, AI models do a poor job of retrieving information involving long-term context.<\/p>\n<p>With Titans AI, Behrouz and other Google researchers sought to build an architecture which enables AI models to develop a long-term memory that can be continually run, while forgetting information so that it be computationally optimised.<\/p>\n<p>To this end, the researchers designed an architecture that encodes history into the parameters of a neural network. Three variants were used \u2014 Memory as Context (MAC), Memory as Gating (MAG), and Memory as a Layer (MAL). Each of these variants is suited for particular tasks.<\/p>\n<p>Additionally, Titans uses a new surprise-based learning systen, which tells AI models to remember unexpected or key information about a topic. These two changes allow Titans architecture to showcase improved memory function in LLMs.<\/p>\n<div align=\"center\">\n<blockquote class=\"twitter-tweet\">\n<p dir=\"ltr\" lang=\"en\">In the BABILong benchmark, Titans (MAC) shows outstanding performance, where it effectively scales to larger than 2M context window, outperforming large models like GPT-4, Llama3 + RAG, and Llama3-70B. <a href=\"https:\/\/t.co\/ZdngmtGIoW\" rel=\"nofollow noopener\" target=\"_blank\">pic.twitter.com\/ZdngmtGIoW<\/a><\/p>\n<p>\u2014 Ali Behrouz (@behrouz_ali) <a href=\"https:\/\/twitter.com\/behrouz_ali\/status\/1878860200096182473?ref_src=twsrc%5Etfw\" rel=\"nofollow noopener\" target=\"_blank\">January 13, 2025<\/a><\/p><\/blockquote>\n<\/div>\n<p>In a separate post, Behrouz claimed that based on internal testing on the BABILong benchmark (needle-in-a-haystack approach), Titans (MAC) models were able to outperform large AI models such as GPT-4, LLama 3 + RAG, and LLama 3 70B.<\/p>\n<\/div>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><br \/>\n<br \/><br \/>\n<br \/><a href=\"https:\/\/www.gadgets360.com\/ai\/content-type\/google-titans-ai-architecture-long-term-memory-llm-unveiled-7516611#rss-gadgets-news\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Google researchers unveiled a new artificial intelligence (AI) architecture last week that can enable large language models (LLMs) to remember the long-term context of events and topics. A paper was published by the Mountain View-based tech giant on the topic, and the researchers claim that AI models trained using this architecture displayed a more \u201chuman-like\u201d [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":219743,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"tdm_status":"","tdm_grid_status":"","fifu_image_url":"https:\/\/i.gadgets360cdn.com\/large\/google_logo1_reuters_1737368900674.jpg","fifu_image_alt":"","footnotes":""},"categories":[607],"tags":[1274,6915,890,2739,1098,172933,62,16500,17010,1922,1865],"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/posts\/219742"}],"collection":[{"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/comments?post=219742"}],"version-history":[{"count":1,"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/posts\/219742\/revisions"}],"predecessor-version":[{"id":219744,"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/posts\/219742\/revisions\/219744"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/media\/219743"}],"wp:attachment":[{"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/media?parent=219742"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/categories?post=219742"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/news.talkwithrattan.com\/index.php\/wp-json\/wp\/v2\/tags?post=219742"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}