Date & Time:
November 19, 2024 12:30 pm – 1:30 pm
Location:
Crerar 298, 5730 S. Ellis Ave., Chicago, IL,
11/19/2024 12:30 PM 11/19/2024 01:30 PM America/Chicago Lin Tan (Purdue)- LLMs for Code: More Data or More Domain Knowledge? Can They Replace Programmers? Crerar 298, 5730 S. Ellis Ave., Chicago, IL,

Abstract: Recent techniques leverage deep learning techniques, including large language models (LLMs), to improve coding tasks such as code generation, automated program repair, security vulnerability fixing, and binary analysis. An important question is, whether adding more data or more domain knowledge to deep-learning models is a more effective direction to improve LLMs for code. I will discuss existing studies and techniques that answer this question positively or negatively. I will also introduce our code-generation benchmark RepoCod, which answers the question, “Can Language Models Replace Programmers?”, to some extent. RepoCod tasks are real-world, whole-function code generation with repository-level context and contain test cases for validation. Our results show that GPT-4o and other LLMs achieve < 30% pass@1 on RepoCode’s code generation tasks.

https://lt-asset.github.io/REPOCOD/

Speakers

Lin Tan

Mary J. Elmore New Frontiers Professor. Purdue University

Lin Tan is a Mary J. Elmore New Frontiers Professor in the Department of Computer Science at Purdue University. She received her PhD from the University of Illinois, Urbana-Champaign. Prior to joining Purdue, she was a Canada Research Chair and an associate professor at the University of Waterloo. Her research interests include software dependability, software-AI synergy, and software text analytics. Some of her research focuses are leveraging machine learning and natural language processing techniques to improve software dependability, and using software approaches to improve the dependability of machine learning systems. Dr. Tan’s co-authored papers have received ACM Distinguished Paper Awards at CCS 2024, ASE 2020, MSR 2018, and FSE 2016; and IEEE Micro’s Top Picks in 2006. Dr. Tan was a recipient of an Early Career Academic Achievement Alumni Award by the University of Illinois, Urbana-Champaign, Canada Research Chair, an NSERC Discovery Accelerator Supplements Award, an Ontario Early Researcher Award, an Ontario Professional Engineers Award–Engineering Medal for Young Engineer, and multiple industry awards including J.P.Morgan AI Faculty Research Awards, Meta/Facebook Research Awards, Google Faculty Research Awards, and an IBM CAS Research Project of the Year Award. She has served as program co-chair of FSE 2024 (one of the top 2 conferences in software engineering). She was an associate editor of IEEE Transactions on Software Engineering (2017-2022) and Springer Empirical Software Engineering Journal (2015-2021). She was the ACM SIGSOFT Treasurer and an elected Member-at-Large (2021-2024).

Related News & Events

UChicago CS News

UChicago Researchers Receive Google Privacy Faculty Award for Research on AI Privacy Risks

Nov 22, 2024
UChicago CS News

The Climate App Designed to Tackle Chatham’s Flooding Crisis

Nov 21, 2024
In the News

Globus Receives Multiple Honors in 2024 HPCwire Readers’ and Editors’ Choice Awards

Nov 20, 2024
In the News

Argonne Team Breaks New Ground in AI-Driven Protein Design

Nov 15, 2024
UChicago CS News

DOE Awards Fred Chong and his National Research Team $7.5M to Develop a SMART Software Stack to Control Quantum Computer Noise

Nov 12, 2024
UChicago CS News

CS/LSSG Showcases Sustainability Research and Education

Nov 11, 2024
UChicago CS News

Ph.D. Student Jibang Wu Receives the Stigler Center Ph.D. Dissertation Award for His Work Modeling the Incentive Structures of Reward and Recommendation–Based Systems

Oct 24, 2024
UChicago CS News

Rebecca Willett Receives the SIAM Activity Group on Data Science Career Prize

Oct 23, 2024
UChicago CS News

UChicago CS Researchers Shine at UIST 2024 with Papers, Posters, Workshops and Demonstrations

Oct 10, 2024
UChicago CS News

UChicago Scientists Receive Grant to Expand Global Data Management Platform, Globus

Oct 03, 2024
UChicago CS News

UChicago Researchers Demonstrate the Quantifiable Uniqueness of Former President Donald Trump’s Language Use

Sep 30, 2024
UChicago CS News

Five UChicago CS students named to Siebel Scholars class of 2025

Sep 20, 2024
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube