{"id":53301,"date":"2025-08-05T17:08:31","date_gmt":"2025-08-05T21:08:31","guid":{"rendered":"https:\/\/engineering.jhu.edu\/ams\/?post_type=news&#038;p=53301"},"modified":"2025-09-17T13:32:12","modified_gmt":"2025-09-17T17:32:12","slug":"charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou","status":"publish","type":"news","link":"https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/","title":{"rendered":"Charting new directions in statistical deep learning and AI: Meet Soufiane Hayou\u00a0"},"content":{"rendered":"<p><span data-contrast=\"auto\"><a href=\"https:\/\/engineering.jhu.edu\/ams\/faculty\/soufiane-hayou\/\">Soufiane Hayou<\/a>\u00a0joined the <a href=\"https:\/\/engineering.jhu.edu\/ams\/\">Department of Applied Mathematics and Statistics<\/a>, as well as the <a href=\"https:\/\/ai.jhu.edu\">Data Science and AI Institute<\/a> (DSAI), on August 1. Prior to joining Hopkins, he was a <\/span><span data-contrast=\"none\">Peng Tsu Ann Assistant Professor at <\/span><span data-contrast=\"auto\">the <\/span><span data-contrast=\"none\">National University of Singapore and a researcher at the Simons Institute for the Theory of Computing at UC Berkeley. His research explores the mathematical foundations of deep learning, with an emphasis on uncovering how large neural networks behave and scale. <\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><span data-ccp-props=\"{}\">\u00a0<\/span><\/p>\n<p><strong>Tell us a little about yourself.\u202f\u202f\u00a0<\/strong><\/p>\n<p><span data-contrast=\"none\">I\u2019m originally from Morocco and grew up in Khenifra, a small town in the Atlas Mountains. After high school and two years of intensive preparatory courses (classes pr\u00e9paratoires) for Frances elite universities, I was admitted to \u00c9cole Polytechnique, where I earned an engineering degree and a master\u2019s in applied mathematics. I also completed a master\u2019s in mathematical finance at Pierre et Marie Curie University in Paris. I then moved to the UK for a PhD in Statistics and Machine Learning at the University of Oxford. Afterward, I joined the National University of Singapore as a Peng Tsu Ann Assistant Professor of Mathematics, followed by two years at the Simons Institute for the Theory of Computing at UC Berkeley. Outside of work, I enjoy playing football (soccer), watching movies, and traveling.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><strong>Describe your research.\u202f\u202f\u00a0<\/strong><\/p>\n<p><span data-contrast=\"none\">My work lies at the intersection of theory and application, where I use mathematical tools to study the behavior of large-scale neural networks and develop principled methods to improve their training and deployment. Lately, my focus has been on enhancing the efficiency of training, fine-tuning, and inference in large language models. I aim to design Pareto-optimal approaches that span the entire lifecycle of these models\u2014from pre-training to deployment\u2014and ultimately apply these ideas to more general AI systems. I&#8217;m continually drawn to the interplay between mathematics and artificial intelligence and plan to explore this direction throughout my career.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><strong>What are some real-world applications of your research?\u202f\u00a0<\/strong><\/p>\n<p><span data-contrast=\"auto\">I develop techniques to make AI systems smarter and more efficient. AI is changing how we work and live\u2013from summarizing documents to powering complex systems that understand images, speech, and text all at once.\u00a0 These models are being adopted at an unprecedented rate, and we&#8217;re only beginning to see their economic impact. To be useful, these systems typically go through two key stages: large-scale pre-training on vast datasets, followed by task-specific adaptation through post-training. My research spans both phases. On the pre-training side, I\u2019ve worked on depth parametrization techniques like Stable ResNet and Depth Hyperparameter Transfer, which offer efficient ways to scale neural networks by increasing their depth. In the post-training phase, I developed LoRA+, an extension of the LoRA method for lightweight fine-tuning of large models. These techniques improve the adaptability of language and vision models while significantly boosting efficiency for downstream tasks.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><span data-ccp-props=\"{}\">\u00a0<\/span><\/p>\n<p><strong>What drew you to this field and focus area?\u202f\u202f\u00a0<\/strong><\/p>\n<p><span data-contrast=\"auto\">I&#8217;ve long been fascinated by studying patterns that emerge when things are pushed to extremes. This mathematical approach offers a powerful way to tackle real0world problems, particularly those dealing with uncertainty. <\/span><span data-contrast=\"none\">This curiosity led me to study probability theory and high-dimensional statistics at \u00c9cole Polytechnique. During that time, I interned as a quantitative researcher at a major investment bank and began exploring deep learning. I quickly noticed that much of model development relied on trial and error, whereas mathematical analysis could offer more principled guidance. Viewing large neural networks as functions of random variables opens the door to rigorous study of their behavior. This realization led me to pursue a PhD in statistical deep learning, with a focus on mathematically grounded methods for training large-scale models. While statistics help address the data side, tools from applied mathematics\u2014such as dynamical systems, stochastic processes, random matrix theory, and PDEs\u2014are essential for understanding model dynamics at scale.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><strong>What excites you about bringing this work to Johns Hopkins?\u202f\u202f\u00a0<\/strong><\/p>\n<p><span data-contrast=\"auto\">I&#8217;m thrilled to be joining Johns Hopkins University, a leading interdisciplinary research institution with an outstanding reputation across multiple scientific disciplines. The newly established Data Science and AI Institute (DSAI) is an example of the university&#8217;s commitment to advancing AI research, alongside other institutes like the Mathematical Institute for Data Science and the SNF Agora Institute. Working at the intersection of theory and applications, I see DSAI as an ideal environment for fostering collaborations with colleagues throughout the Whiting School of Engineering. I&#8217;m equally excited about joining the AMS department, which hosts a dynamic community of exceptional researchers across diverse fields. Another compelling aspect is the opportunity to develop practical AI tools, particularly in health care.\u00a0<\/span><span data-ccp-props=\"{&quot;335559738&quot;:240,&quot;335559739&quot;:240}\">\u00a0<\/span><\/p>\n","protected":false},"template":"","class_list":["post-53301","news","type-news","status-publish","hentry","news_categories-applied-mathematics","news_categories-data-science","news_categories-department-news"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.7 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Charting new directions in statistical deep learning and AI: Meet Soufiane Hayou\u00a0 | Department of Applied Mathematics and Statistics<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Charting new directions in statistical deep learning and AI: Meet Soufiane Hayou\u00a0 | Department of Applied Mathematics and Statistics\" \/>\n<meta property=\"og:description\" content=\"Soufiane Hayou\u00a0joined the Department of Applied Mathematics and Statistics, as well as the Data Science and AI Institute (DSAI), on August 1. Prior to joining Hopkins, he was a Peng&hellip;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/\" \/>\n<meta property=\"og:site_name\" content=\"Department of Applied Mathematics and Statistics\" \/>\n<meta property=\"article:modified_time\" content=\"2025-09-17T17:32:12+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"4 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Charting new directions in statistical deep learning and AI: Meet Soufiane Hayou\u00a0 | Department of Applied Mathematics and Statistics","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/","og_locale":"en_US","og_type":"article","og_title":"Charting new directions in statistical deep learning and AI: Meet Soufiane Hayou\u00a0 | Department of Applied Mathematics and Statistics","og_description":"Soufiane Hayou\u00a0joined the Department of Applied Mathematics and Statistics, as well as the Data Science and AI Institute (DSAI), on August 1. Prior to joining Hopkins, he was a Peng&hellip;","og_url":"https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/","og_site_name":"Department of Applied Mathematics and Statistics","article_modified_time":"2025-09-17T17:32:12+00:00","twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/","url":"https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/","name":"Charting new directions in statistical deep learning and AI: Meet Soufiane Hayou\u00a0 | Department of Applied Mathematics and Statistics","isPartOf":{"@id":"https:\/\/engineering.jhu.edu\/ams\/#website"},"datePublished":"2025-08-05T21:08:31+00:00","dateModified":"2025-09-17T17:32:12+00:00","breadcrumb":{"@id":"https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/engineering.jhu.edu\/ams\/news\/charting-new-directions-in-statistical-deep-learning-and-ai-meet-soufiane-hayou\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/engineering.jhu.edu\/ams\/"},{"@type":"ListItem","position":2,"name":"News","item":"https:\/\/engineering.jhu.edu\/ams\/news\/"},{"@type":"ListItem","position":3,"name":"Charting new directions in statistical deep learning and AI: Meet Soufiane Hayou\u00a0"}]},{"@type":"WebSite","@id":"https:\/\/engineering.jhu.edu\/ams\/#website","url":"https:\/\/engineering.jhu.edu\/ams\/","name":"Hopkins Applied Math & Statistics","description":"Department of Applied Mathematics and Statistics","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/engineering.jhu.edu\/ams\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Department of Applied Mathematics and Statistics","distributor_original_site_url":"https:\/\/engineering.jhu.edu\/ams","push-errors":false,"_links":{"self":[{"href":"https:\/\/engineering.jhu.edu\/ams\/wp-json\/wp\/v2\/news\/53301","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/engineering.jhu.edu\/ams\/wp-json\/wp\/v2\/news"}],"about":[{"href":"https:\/\/engineering.jhu.edu\/ams\/wp-json\/wp\/v2\/types\/news"}],"wp:attachment":[{"href":"https:\/\/engineering.jhu.edu\/ams\/wp-json\/wp\/v2\/media?parent=53301"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}