Home | Portfolio | Beyond Work

Mahmoud Jahanshahi

Ph.D. in Computer Science

Hi, welcome to my page! I’m an AI consultant and faculty member with a Ph.D. in Computer Science, teaching at the University of Tennessee’s College of Emerging and Collaborative Studies. My work bridges academic research and real-world applications, with a focus on mining large-scale, complex data to uncover patterns and drive solutions.

I’ve developed ecosystem-scale tools for software analysis, curated safer datasets for large language models, and delivered insights that inform licensing, security, and data governance. My background spans academia, business intelligence, finance, and strategic decision-making, enabling me to connect technical depth with business outcomes.

Whether through research, teaching, or consulting, I help organizations tackle data challenges, align AI strategies with their goals, and make confident, forward-looking decisions at the intersection of data, technology, and innovation.

When I’m not immersed in work, I enjoy playing the piano or pushing my limits with CrossFit. You can learn more about my interests here.


Socials:


Experience

Teaching Assistant Professor

University of Tennessee, Knoxville, USA

Teaching at the College of Emerging and Collaborative Studies. Current courses include Introduction to the World of AI, Data Knowledge & Discovery, Data Wrangling, and Transdisciplinary Problem Solving.

August 2025 - Present

Graduate Research Assistant

University of Tennessee, Knoxville, USA

Conducting research on Open Source Software supply chains through repository mining, with a focus on copy-based reuse and its implications.

May 2021 - May 2025

Business Intelligence Consultant

Freelance

Providing specialized Business Intelligence consulting services, focusing on financial data analysis and strategic insights across diverse industries.

April 2020 - May 2021

Senior Data Scientist

Mobile Communications Company of Iran (Hamrahe Aval), Tehran, Iran

Analyzing business processes, data, and reporting needs. Developing and maintaining reports, dashboards, and analyses using Oracle Business Intelligence Enterprise Edition (OBIEE). Collaborating with the data warehouse team on requirements gathering, design, testing, and ongoing development.

May 2019 - April 2020

Strategic Investments Lead

Mobile Communications Company of Iran (Hamrahe Aval), Tehran, Iran

Managing a team of 5 professionals in handling complex investment projects, interfacing with C-level management, negotiating cooperation models and contract terms, and supporting the development of investment strategies and policies.

February 2018 - May 2019

International Investment Analyst

Mobile Communications Company of Iran (Hamrahe Aval), Tehran, Iran

Using financial models to project earnings, screening markets for acquisition opportunities, predicting market events, and interpreting financial statements.

February 2016 - February 2018

Invited Talks

  • Invited Speaker: "Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets", at the 1st International Workshop on Large Language Model Supply Chain Analysis (LLMSC), held in conjuction with the ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA), Trondheim, Norway, June 2025.
    Presentation

Publications ORCID

  • M. Jahanshahi and A. Mockus, "Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets," 2025 IEEE/ACM International Workshop on Large Language Models for Code (LLM4Code), Ottawa, ON, Canada, 2025, pp. 104-111, doi: 10.1109/LLM4Code66737.2025.00018.
    Won the LLM4Code Best Paper Award.
    Paper - Replication Package - GitHub Repo
  • M. Jahanshahi, D. Reid and A. Mockus, "Beyond Dependencies: The Role of Copy-Based Reuse in Open Source Software Development," ACM Transactions on Software Engineering and Methodology (TOSEM), New York, NY, USA, 2025, Just Accepted, doi: 10.1145/3715907.
    Paper - Replication Package - GitHub Repo
  • M. Jahanshahi, D. Reid, A. McDaniel and A. Mockus, "OSS License Identification at Scale: A Comprehensive Dataset Using World of Code," 2025 IEEE/ACM 22nd International Conference on Mining Software Repositories (MSR), Ottawa, ON, Canada, 2025, pp. 144-148, doi: 10.1109/MSR66628.2025.00032.
    Paper - Replication Package
  • C. Miller, M. Jahanshahi, A. Mockus, B. Vasilescu and C. Kastner, "Understanding the Response to Open-Source Dependency Abandonment in the npm Ecosystem," 2025 IEEE/ACM 47th International Conference on Software Engineering (ICSE), Ottawa, ON, Canada, 2025, pp. 2355-2367, doi: 10.1109/ICSE55347.2025.00004.
    Won the ACM SIGSOFT Distinguished Paper Award.
    Paper - Replication Package - GitHub Repo
  • A. M. Thakur, R. Milewicz, M. Jahanshahi, L. Paganini, B. Vasilescu, and A. Mockus, "Scientific Open-Source Software Is Less Likely to Become Abandoned Than One Might Think! Lessons from Curating a Catalog of Maintained Scientific Software," Proceedings of the ACM on Software Engineering, Vol. 2, No. FSE, New York, NY, USA, July 2025, Art. No. FSE099, pp. 2216-2239, doi: 10.1145/3729369.
    Paper - Replication Package
  • M. Jahanshahi and A. Mockus, "Dataset: Copy-based Reuse in Open Source Software," 2024 IEEE/ACM 21st International Conference on Mining Software Repositories (MSR), Lisbon, Portugal, 2024, pp. 42-47, doi: 10.1145/3643991.3644868.
    Paper - GitHub Repo
  • D. Reid, M. Jahanshahi and A. Mockus, "The Extent of Orphan Vulnerabilities from Code Reuse in Open Source Software," 2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE), Pittsburgh, PA, USA, 2022, pp. 2104-2115, doi: 10.1145/3510003.3510216.
    Nominated for the ACM SIGSOFT Distinguished Paper Award.
    Paper - GitHub Repo
  • E. Lyulina and M. Jahanshahi, "Building the Collaboration Graph of Open-Source Software Ecosystem," 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR), Madrid, Spain, 2021, pp. 618-620, doi: 10.1109/MSR52588.2021.00086.
    Paper - GitHub Repo

Review Roles

Journals


Computers & Security (COSE)
ISSN: 0167-4048
Since 2025
ACM Transactions on Software Engineering and Methodology (TOSEM)
ISSN: 1557-7392
Since 2024
Empirical Software Engineering (EMSE)
ISSN: 1382-3256
Since 2022

Conferences


23rd International Conference on Mining Software Repositories (MSR)
Data and Tool Showcase Track
April 2026
2nd ACM International Conference on AI-powered Software (AIware)
Main Track
November 2025
20th International Conference on Mining Software Repositories (MSR)
Mining Challenge
May 2023

Awards

LLM4Code Best Paper Award

Paper: “Cracks in the Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets”
Venue: Second International Workshop on Large Language Models for Code (LLM4Code), 2025
May 2025

ACM SIGSOFT Distinguished Paper Award

Paper: “Understanding the Response to Open-Source Dependency Abandonment in the npm Ecosystem”
Venue: 47th International Conferenceon Software Engineering (ICSE), 2025
May 2025

Skills

Programming
  • Terminal/Bash Scripting
  • Databases (SQL, MongoDB, etc.)
  • Python
  • R
  • C
  • Data Visualization (Power BI, Tableau, etc.)
  • Project Management Tools
Languages
  • Persian (Native)
  • English (Fluent - C1)
  • German (Working Knowledge - B1)
Competencies
  • Advanced analytical and problem-solving capabilities
  • Ability to tackle complex, intellectually demanding problems with minimal guidance
  • Rapid learning and adaptability in dynamic environments
  • Leadership in cross-functional teams, with strong communication and organizational skills

Certificates


Education

University of Tennessee, Knoxville, USA

Doctor of Philosophy, Computer Science
Dissertation: Copy-Based Reuse and its Implications in Open Source Software Supply Chains
May 2021 - May 2025

Sharif University of Technology, Tehran, Iran

Master of Science, Industrial Engineering - Industrial
Thesis: The Influence of Information Presentation and Risk Attitude on Asset Allocation in Financial Markets
September 2011 - September 2013

Mazandaran Institute of Technology, Babol, Iran

Bachelor of Science, Industrial Engineering - System Planning and Analysis
January 2007 - July 2011

National Organization for Development of Exceptional Talents, Babol, Iran

High School Diploma, Mathematics
September 1999 - September 2006