Date & Time:
November 19, 2024 12:30 pm – 1:30 pm
Location:
Crerar 298, 5730 S. Ellis Ave., Chicago, IL,
11/19/2024 12:30 PM 11/19/2024 01:30 PM America/Chicago Lin Tan (Purdue)- LLMs for Code: More Data or More Domain Knowledge? Can They Replace Programmers? Crerar 298, 5730 S. Ellis Ave., Chicago, IL,

Abstract: Recent techniques leverage deep learning techniques, including large language models (LLMs), to improve coding tasks such as code generation, automated program repair, security vulnerability fixing, and binary analysis. An important question is, whether adding more data or more domain knowledge to deep-learning models is a more effective direction to improve LLMs for code. I will discuss existing studies and techniques that answer this question positively or negatively. I will also introduce our code-generation benchmark RepoCod, which answers the question, “Can Language Models Replace Programmers?”, to some extent. RepoCod tasks are real-world, whole-function code generation with repository-level context and contain test cases for validation. Our results show that GPT-4o and other LLMs achieve < 30% pass@1 on RepoCode’s code generation tasks.

https://lt-asset.github.io/REPOCOD/

Speakers

Lin Tan

Mary J. Elmore New Frontiers Professor. Purdue University

Lin Tan is a Mary J. Elmore New Frontiers Professor in the Department of Computer Science at Purdue University. She received her PhD from the University of Illinois, Urbana-Champaign. Prior to joining Purdue, she was a Canada Research Chair and an associate professor at the University of Waterloo. Her research interests include software dependability, software-AI synergy, and software text analytics. Some of her research focuses are leveraging machine learning and natural language processing techniques to improve software dependability, and using software approaches to improve the dependability of machine learning systems. Dr. Tan’s co-authored papers have received ACM Distinguished Paper Awards at CCS 2024, ASE 2020, MSR 2018, and FSE 2016; and IEEE Micro’s Top Picks in 2006. Dr. Tan was a recipient of an Early Career Academic Achievement Alumni Award by the University of Illinois, Urbana-Champaign, Canada Research Chair, an NSERC Discovery Accelerator Supplements Award, an Ontario Early Researcher Award, an Ontario Professional Engineers Award–Engineering Medal for Young Engineer, and multiple industry awards including J.P.Morgan AI Faculty Research Awards, Meta/Facebook Research Awards, Google Faculty Research Awards, and an IBM CAS Research Project of the Year Award. She has served as program co-chair of FSE 2024 (one of the top 2 conferences in software engineering). She was an associate editor of IEEE Transactions on Software Engineering (2017-2022) and Springer Empirical Software Engineering Journal (2015-2021). She was the ACM SIGSOFT Treasurer and an elected Member-at-Large (2021-2024).

Related News & Events

computation performed on qubits
UChicago CS News

Constraints on Quantum-Advantage Experiments Due to Noise

Nov 13, 2025
headshot
UChicago CS News

Data Movement Without Borders: Ian Foster and the Globus Team Honored with SC25’s Test of Time Award

Nov 13, 2025
Video

How artists can protect their work from AI | Dr. Heather Zheng | TEDxChicago

Nov 05, 2025
figure detailing how net diffusion works
UChicago CS News

AI-Powered Network Management: GATEAU Project Advances Synthetic Traffic Generation

Oct 29, 2025
girl with robot
UChicago CS News

Sebo Lab: Programming robots to better interact with humans

Oct 28, 2025
Inside the Lab icon
Video

Inside The Lab: How Can Robots Improve Our Lives?

Oct 27, 2025
headshot
UChicago CS News

UChicago CS Student Awarded NSF Graduate Research Fellowship

Oct 27, 2025
LLM graphic
UChicago CS News

Why Can’t Powerful LLMs Learn Multiplication?

Oct 27, 2025
headshot
UChicago CS News

Celebrating Excellence in Human-Computer Interaction: Yudai Tanaka Named 2025 Google North America PhD Fellow

Oct 23, 2025
best demo award acceptance
UChicago CS News

Shape n’ Swarm: Hands-On, Shape-Aware Generative Authoring for Swarm User Interfaces Wins Best Demo at UIST 2025

Oct 22, 2025
gas example
UChicago CS News

Redirecting Hands in Virtual Reality With Galvanic Vestibular Stimulation: UChicago Lab to Present First-of-Its-Kind Work at UIST 2025

Oct 13, 2025
prophet arena explanation
UChicago CS News

Breaking New Ground in Machine Learning and AI: New Platform Prophet Arena Redefines How We Evaluate AI’s Intelligence

Oct 13, 2025
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube