{"id":27694,"date":"2024-01-18T08:10:20","date_gmt":"2024-01-18T13:10:20","guid":{"rendered":"https:\/\/michigan.it.umich.edu\/news\/?p=27694"},"modified":"2024-07-08T06:04:13","modified_gmt":"2024-07-08T10:04:13","slug":"using-natural-language-processing-to-improve-everyday-life","status":"publish","type":"post","link":"https:\/\/michigan.it.umich.edu\/news\/2024\/01\/18\/using-natural-language-processing-to-improve-everyday-life\/","title":{"rendered":"Using natural language processing to improve everyday life"},"content":{"rendered":"<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"600\" height=\"398\" src=\"https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-600x398.jpg\" alt=\"Joyce Chai\" class=\"wp-image-27705\" srcset=\"https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-600x398.jpg 600w, https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-300x199.jpg 300w, https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-400x266.jpg 400w, https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-768x510.jpg 768w, https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-665x442.jpg 665w, https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured.jpg 1500w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption class=\"wp-element-caption\">Joyce Chai (image courtesy U-M College of Engineering, Computer Science and Engineering)<\/figcaption><\/figure>\n<\/div>\n\n\n<p><a href=\"https:\/\/web.eecs.umich.edu\/~chaijy\/\">Joyce Y. Chai<\/a>, professor of electrical engineering and computer science, College of Engineering, and colleagues have been seeking answers to complex questions using natural language processing and machine learning that may improve everyday life.<\/p>\n\n\n\n<p>Some of the algorithms that they develop in their work are meant for tasks that machines may have little to no prior knowledge of. For example, to guide human users to gain a particular skill (e.g., building a special apparatus or even, \u201cTell me how to <a href=\"https:\/\/arxiv.org\/abs\/2311.00738\">bake a cake<\/a>\u201d). A set of instructions based on the observation of what the user is doing, e.g., to correct mistakes or provide the next step, would be generated by Generative AI, or <a href=\"https:\/\/genai.umich.edu\/\">GenAI<\/a>. The better the data and engineering behind the AI, the more useful the instructions will be.&nbsp;<\/p>\n\n\n\n<p>\u201cTo enable machines to quickly learn and adapt to a new task, developers may give a few examples of recipe steps with both language instructions and video demonstrations. Machines can then (hopefully) guide users through the task by recognizing the right steps and generating relevant instructions using GenAI,\u201d said Chai.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-medium\"><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"294\" src=\"https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/AI-ML-DeepLearning-300x294.png\" alt=\"\" class=\"wp-image-27695\" srcset=\"https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/AI-ML-DeepLearning-300x294.png 300w, https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/AI-ML-DeepLearning-600x589.png 600w, https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/AI-ML-DeepLearning-768x754.png 768w, https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/AI-ML-DeepLearning-665x653.png 665w, https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/AI-ML-DeepLearning.png 1090w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/commons.wikimedia.org\/w\/index.php?curid=122023216\">By Lollixzc &#8211; Own work, CC BY-SA 4.0<\/a><\/figcaption><\/figure>\n<\/div>\n\n\n<h2 class=\"wp-block-heading\">What are AI, machine learning, deep learning, and natural language processing?&nbsp;<\/h2>\n\n\n\n<p>It might help to take a step back to understand AI, machine learning (ML), and deep learning at a high level. Both ML and deep learning are subsets of AI, as seen in the figure. Some natural language processing (NLP) tasks fall within the realm of deep learning. They all work together and build off of each other.<\/p>\n\n\n\n<p>Artificial Intelligence, or AI, is a branch of computer science that attempts to simulate human intelligence with computers. It involves creating systems to perform tasks that usually need human intelligence, such as visual perception, speech recognition, decision-making, and translation between languages.<\/p>\n\n\n\n<p>\u201cNLP is a sub-area in AI, where DL\/ML approaches are predominantly applied,\u201d stated Chai.<\/p>\n\n\n\n<p>Christopher Brown, a research data scientist, with <a href=\"https:\/\/arc.umich.edu\">ITS Advanced Research Computing<\/a> (ARC) and a member of the <a href=\"https:\/\/arc.umich.edu\/consulting\/\">ARC consulting team<\/a>, explains that ML is a subfield of AI. Within ML, algorithms are used to generalize situations beyond those seen in training data and then complete tasks without further guidance from people. A good example is <a href=\"https:\/\/genai.umich.edu\/\">U-M GPT<\/a>. The large language models (LLMs) accessible via U-M GPT are trained with millions of diverse examples. \u201cThe goal is to train the models to reliably predict, translate, or generate something.\u201d&nbsp;<\/p>\n\n\n\n<p>\u201cData is any information that can be formatted and fed into an algorithm that can be used for some task, including journal articles, chats, numbers, videos, audio, and texts,\u201d said Brown, Algorithms can be trained to perform tasks using these real-world data.&nbsp;<\/p>\n\n\n\n<p>Natural Language Processing is a branch of artificial intelligence that helps computers understand and generate human language in a way that is both meaningful and useful to humans. NLP teaches computers to understand languages and then respond so that humans can understand, and even accounting for when rich context language is used.&nbsp;<\/p>\n\n\n\n<p>\u201cNLP is highly interdisciplinary, and involves multiple fields, such as computer science, linguistics, philosophy, cognitive science, statistics, mathematics, etc.,\u201d said Chai.<\/p>\n\n\n\n<p>Examples of NLP are everywhere: when you ask Siri for directions, or when Google efficiently completes your half-typed query, or even when you get suggested replies in your email.&nbsp;<\/p>\n\n\n\n<p>Ultimately NLP, along with AI, can be used to make interactions between humans and machines as natural and as easy as possible.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">A lot of data is needed to train the models<\/h2>\n\n\n\n<p>Dr. Chai and her team use large language models, a lot of data, and computing resources. These models take longer to train and are harder to interpret. Brown says, \u201cThe state of the art, groundbreaking work tends to be in this area.\u201d&nbsp;<\/p>\n\n\n\n<p>Dr. Chai uses deep learning algorithms that make predictions about what the next part of the task or conversation is. \u201cFor example, they use deep learning and the transformer architecture to enable embodied agents to learn how new words are connected to the physical environment, to follow human language instructions, and to collaborate with humans to come up with a shared plan,\u201d Brown explains.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">The technology that supports this work<\/h2>\n\n\n\n<p>To accomplish her work, Dr. Chai uses the <a href=\"https:\/\/arc.umich.edu\/greatlakes\/\">Great Lakes High-Performance Computing Cluster<\/a> and <a href=\"https:\/\/arc.umich.edu\/turbo\/\">Turbo Research Storage<\/a>, both of which are managed by U-M\u2019s Advanced Research Computing Group (ARC) in Information and Technology Services. She has 16 GPUs on Great Lakes at the ready, with the option to use more at any given time.&nbsp;<\/p>\n\n\n\n<p>A GPU, or Graphics Processing Unit, is a piece of computer equipment that is good at displaying pictures, animations, and videos on your screen. The GPU is especially adept at quickly creating and manipulating images. Traditionally, GPUs were used for video games and professional design software where detailed graphics were necessary. But more recently, researchers including Dr. Chai discovered that GPUs are also good at handling many simple tasks at the same time. This includes tasks like scientific simulations and AI training where a lot of calculations need to be done in parallel (which is perfect for training large language models).<\/p>\n\n\n\n<p>\u201cGPUs are popular for deep learning, and we will continue to get more and better GPUs in the future. There is a demand, and we will continue supporting this technology so that deep learning can continue to grow,\u201d said Brock Palen, ITS Advanced Research Computing director.&nbsp;<\/p>\n\n\n\n<p>Chai and her team also leveraged 29 terabytes of the <a href=\"https:\/\/arc.umich.edu\/turbo\/\">Turbo Research Storage<\/a> service at ARC. NLP benefits from the high-capacity, reliable, secure, and fast storage solution. Turbo enables investigators across the university to store and access data needed for their research via Great Lakes.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Great Lakes HPC in the classroom&nbsp;<\/h2>\n\n\n\n<p>ARC offers <a href=\"https:\/\/arc.umich.edu\/greatlakes\/course-accounts\/\">classroom use<\/a> of high-performance computing cluster resources on the <a href=\"https:\/\/arc.umich.edu\/greatlakes\/\">Great Lakes High-Performance Computing Cluster<\/a>.&nbsp;<\/p>\n\n\n\n<p>Dr. Chai regularly leverages this resource. \u201cOver 300 students have benefited from this experience. We have homework that requires the use of the Great Lakes, e.g., having students learn how to conduct experiments in a managed job-scheduling system like SLURM. This will benefit them in the future if they engage in any compute-intensive R&amp;D (research and development).<\/p>\n\n\n\n<p>\u201cFor my NLP class, I request Great Lakes access for my students so they have the ability to develop some meaningful final projects. We also use the Great Lakes HPC resources <a href=\"https:\/\/arxiv.org\/pdf\/2305.16579.pdf\">to study the reproducibility for NLP beginners<\/a>,\u201d said Chai. A <a href=\"https:\/\/sled.eecs.umich.edu\/teaching\/\">gallery<\/a> is available for many of the student projects.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">The UMRCP defrays costs<\/h2>\n\n\n\n<p>The <a href=\"https:\/\/arc.umich.edu\/umrcp\/\">U-M Research Computing Package<\/a> is a set of cost-sharing allocations offered by ITS ARC, and they are available upon request. Other units offer additional cost-sharing to researchers. Chai said, \u201cWe typically use the nodes owned by my group for research projects that require intensive, large-scale GPU model training. We use the UMRCP for less intensive tasks, thereby extending the budgetary impact of the allocations.\u201d&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Joyce Y. Chai, professor of electrical engineering and computer science, College of Engineering, and colleagues have been seeking answers to complex questions using natural language processing and machine learning that may improve everyday life.<\/p>\n","protected":false},"author":50,"featured_media":27705,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_uag_custom_page_level_css":"","_umich_oidc_access":"","_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_ef_editorial_meta_date_first-draft-date":"","_ef_editorial_meta_paragraph_assignment":"","_ef_editorial_meta_checkbox_needs-photo":"","_ef_editorial_meta_number_word-count":"","footnotes":""},"categories":[4],"tags":[852,254,742,130],"class_list":["post-27694","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-features","tag-engineering","tag-hpc","tag-machine-learning","tag-research"],"uagb_featured_image_src":{"full":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured.jpg",1500,996,false],"thumbnail":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-400x266.jpg",400,266,true],"medium":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-300x199.jpg",300,199,true],"medium_large":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-768x510.jpg",665,442,true],"large":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-600x398.jpg",600,398,true],"1536x1536":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured.jpg",1500,996,false],"2048x2048":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured.jpg",1500,996,false],"excerpt-thumbnail":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-200x140.jpg",200,140,true],"themonic-thumbnail":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-60x42.jpg",60,42,true],"ioslider-thumbnail":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-658x300.jpg",658,300,true],"post-thumbnail":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-665x442.jpg",665,442,true],"400x250-crop":["https:\/\/michigan.it.umich.edu\/news\/wp-content\/uploads\/2024\/01\/joyce-chai-featured-400x250.jpg",400,250,true]},"uagb_author_info":{"display_name":"D. Stephanie Dascola, Medical School Office of Research","author_link":"https:\/\/michigan.it.umich.edu\/news\/author\/sdascola\/"},"uagb_comment_info":0,"uagb_excerpt":"Joyce Y. Chai, professor of electrical engineering and computer science, College of Engineering, and colleagues have been seeking answers to complex questions using natural language processing and machine learning that may improve everyday life.","_links":{"self":[{"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/posts\/27694","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/users\/50"}],"replies":[{"embeddable":true,"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/comments?post=27694"}],"version-history":[{"count":5,"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/posts\/27694\/revisions"}],"predecessor-version":[{"id":27707,"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/posts\/27694\/revisions\/27707"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/media\/27705"}],"wp:attachment":[{"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/media?parent=27694"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/categories?post=27694"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michigan.it.umich.edu\/news\/wp-json\/wp\/v2\/tags?post=27694"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}