Towards Agentic AI for Science: Hypothesis Generation, Comprehension, Quantification, and Validation (AAAI 2025 Spring Symposium Series)

March 31-April 02, 2025 | San Francisco Airport Marriott | Waterfront | Burlingame, CA, USA

About the Symposium

Our mission is to foster interdisciplinary collaboration to develop fully autonomous AI systems, addressing challenges like benchmark datasets, human-AI collaboration, robust tools and methods for validating AI outputs, and trustworthiness. By tackling these issues, we can unlock AI's transformative potential in research. In this symposium, themed Agentic AI for Science, we will explore these critical topics and welcome diverse perspectives. We will focus on integrating agentic AI systems to enhance scientific discovery while upholding rigorous standards. For AI to contribute effectively, it must generate novel hypotheses, comprehend their applications, quantify testing resources, and validate feasibility through well-designed experiments. This symposium serves as a vital forum for collaboration and knowledge-sharing aimed at redefining the landscape of scientific discovery. This symposium aims to address four main research thrusts to propel future research, including (non-exclusively):

Thrust 1. Design and development of agentic AI systems for scientific discovery. The emergence of agentic AI, powered by foundation models—particularly generative models—opens up unprecedented opportunities for scientific discovery. These systems can potentially revolutionize various aspects of the scientific process, including hypothesis generation, comprehension of complex scientific phenomena, quantification, and validation. Designing and developing effective agentic AI systems for scientific discovery is both exciting and non-trivial. Pioneering work in this field has already demonstrated the promise of leveraging scientific tools, agents, and knowledge graphs. Notable examples include ChemCrow, which showcases the potential of AI in chemistry; Crispr-GPT, which applies AI to genetic engineering; and SciAgents , which illustrates the power of multi-agent systems in scientific discovery. These groundbreaking studies highlight the transformative potential of agentic AI in accelerating scientific progress and opening new avenues for research. Key research topics in this thrust include (but not limited to):

  • Developing scientific foundation models: Tailoring general foundation models specifically for various scientific fields to enhance relevance and accuracy.
  • Effective scientific tool augmentation: Enhancing existing scientific tools and methodologies with agentic AI capabilities.
  • Multi-agent decomposition design: Developing frameworks for scientific hypothesis generation using multiple specialized AI agents.
  • Human-in-the-loop agentic systems: Improving reliability and interpretability of AI-driven scientific discoveries through strategic human intervention.

Thrust 2. Theoretical foundation for scientific agentic AI. Developing agentic scientific AI requires methods to quantify the predictions and performance of these systems, as well as to validate the scientific hypotheses they generate. A thorough investigation of agentic scientific AI systems also demands solid theoretical foundations and tools to ensure guarantees on their behavior. To analyze and evaluate such systems, we will incorporate theoretical tools in modeling, logical reasoning, model validation and diagnosis, interpretable AI, and other general methods that can provide guarantees on agentic systems. Key topics in this area include, but are not limited to, the following:

  • Theoretical foundation: Statistical models and theories of agentic scientific AI, such as theoretical studies on in-context learning, multi-agent communications, game theory, physics-informed hard and soft optimization constraints, and neural operators.
  • Logic reasoning: Inductive, deductive, and abductive reasoning; Bayesian reasoning and probabilistic programming; neural-symbolic approaches.
  • Model quantification, validation, diagnosis: Theory-driven metrics for quantifying AI system performance; self-evaluation of LLMs; data valuation and data-centric AI; diagnostics for data, architecture, and training processes; creation of standardized benchmarks for evaluating the validity of scientific hypothesis generation; scientific facts and hallucination.
  • Interpretable AI: Approaches for explaining agentic AI system behaviors; quantifying trust, safety, and transparency; mechanistic interpretability.

Thrust 3. Practical application of scientific agentic AI. Deploying agentic AI systems in practical scientific research across diverse domains presents numerous challenges, particularly due to the need for domain-specific adaptation such as the unique data formats and model constraints of each scientific field. Bias in training data poses a significant risk, especially in sensitive domains like medicine. Trustworthiness and explainability are essential for scientists to confidently integrate AI-generated hypotheses and solutions into their research. Furthermore, ethical considerations arise when AI systems potentially automate research decisions that may impact public health, policy, or environmental outcomes, underscoring the importance of responsible AI deployment in science.

  • Domain-specific model adaptation: Adapting agentic AI models to handle domain-specific data formats, workflows, and tools across various scientific fields; transfer learning and data-efficient fine-tuning.
  • Bias detection and mitigation: Identifying and mitigating bias in training data, model design and outputs; fairness-aware AI systems for sensitive domains like healthcare and social science.
  • Robustness, trustworthiness and explainability: Methods for improving the transparency and explainability of agentic AI systems in scientific research; uncertainty interpretation and quantification.
  • Ethical considerations and responsible use of agentic AI in sensitive research areas; development of AI governance models to ensure accountability and human oversight in automated scientific workflows.

Thrust 4. Open problems and challenges on scientific agentic AI. Despite the promising potential of agentic AI in scientific discovery, many open problems and challenges remain to be addressed. These may include:

  • Automatic curation of domain-specific scientific domains and integration of the knowledge into agentic AI systems.
  • Advanced mechanisms of multi-agent collaboration in scientific discovery, with considerations of their scalability and computational efficiency.
  • Continual evolution and learning of agentic AI systems; Mechanisms for updating models and improving performance based on experimental results, new data and discoveries.
  • Validation and reproducibility of results generated by agentic AI systems.

Keynote Speakers

We are honored to have invited five distinguished keynote speakers.

Dr. Michael W. Mahoney

Dr. Michael W. Mahoney (Tentative)

Department of Statistics, University of California, Berkeley

Dr. Sanmi Koyejo

Dr. Sanmi Koyejo (Tentative)

Department of Computer Science, Stanford University

Dr. Su-In Lee

Dr. Su-In Lee (Tentative)

Department of Computer Science, University of Washington

Dr. Marinka Zitnik

Dr. Marinka Zitnik (Tentative)

Department of Biomedical Informatics, Harvard Medical School

Dr. Markus J. Buehler

Dr. Markus J. Buehler

Department of Civil and Environmental Engineering, Massachusetts Institute of Technology

Call for Papers

We are pleased to announce the Symposium on Agentic AI for Science: Hypothesis Generation, Comprehension, Quantification, and Validation as part of AAAI Spring Symposium Series, 2025, to be held at San Francisco Airport Marriott Waterfront, Burlingame, CA, USA on March 31-April 02, 2025. This symposium aims to explore the transformative potential of agentic AI in scientific discovery, focusing on hypothesis generation, validation, and other critical stages of the scientific process. By fostering interdisciplinary collaboration, the symposium seeks to address challenges and unlock new opportunities in the design and application of agentic AI systems.

Important Deadlines

All deadlines are 11:59pm Anywhere on Earth (AoE):

  • Paper submission deadline: February 1, 2025
  • Acceptance notification: February 10, 2025
  • Camera-ready deadline: February 17, 2025
  • Symposium dates: March 31–April 2, 2025

Submission Requirements

We welcome submissions in the form of both full papers and position or short papers. Submissions may be archival or non-archival. Accepted archival papers will be published as part of the “Proceedings of the AAAI Symposium Series” by the AAAI Library.

  • Maximum length: 7 pages of main content, with unlimited pages for references.
  • Anonymity: Submissions must be anonymized for double-blind review. Please do not include acknowledgments in the initial submission.
  • Formatting: Use the AAAI-25 Author Kit for formatting your submission.

Submission Site

Submissions will be managed via the AAAI Official EasyChair site: EasyChair Submission Portal.

Reviewing Process

Each submission will undergo a rigorous double-blind peer-review process. Submissions will be evaluated based on technical merit, originality, relevance, and adherence to ethical principles.

Publication and Presentation Policies

Accepted papers will be published in the "Proceedings of the AAAI Symposium Series" and presented during the symposium. Authors are required to present their papers in-person during the symposium.

Program Committee Co-Chairs

Symposium Schedule

The symposium is scheduled to take place during the AAAI Spring Symposium Series from March 31 to April 2, 2025.

Date Time Schedule
03/31 09:00 - 09:30 AM Opening Remarks
09:30 - 10:20 AM Keynote #1
10:20 - 10:30 AM Coffee Break
10:30 AM - 12:30 PM Technical Presentation #1
12:30 - 01:30 PM Lunch Break
01:30 - 03:30 PM Accepted Talks #1
03:30 - 03:40 PM Coffee Break
03:40 - 04:30 PM Keynote #2
04:30 - 05:00 PM Panel Discussion #1
04/01 09:00 - 09:10 AM Recap of Day 01
09:10 - 10:00 AM Keynote #3
10:00 - 11:00 AM Technical Presentation #2
11:00 - 11:10 AM Coffee Break
11:10 AM - 12:30 PM Breakout Sessions
12:30 - 01:30 PM Lunch Break
01:30 - 02:20 PM Keynote #4
02:20 - 02:30 PM Coffee Break
02:30 - 03:30 PM Accepted Talks #2
03:30 - 04:30 PM Panel Discussion #2
04:30 - 05:00 PM Brainstorming Session #1
04/02 09:00 - 09:10 AM Recap of Day 02
09:10 - 10:00 AM Keynote #5
10:00 - 11:20 AM Accepted Talks #3
11:20 - 11:30 AM Coffee Break
11:30 AM - 12:30 PM Workshop
12:30 - 01:00 PM Closing Remarks
<

ORGANIZERS

Dr. Lifu Huang

Dr. Lifu Huang

UC Davis

Dr. Danai Koutra

Dr. Danai Koutra

University of Michigan

Dr. Temiloluwa Prioleau

Dr. Temiloluwa Prioleau

Dartmouth College

Dr. Qingyun Wu

Dr. Qingyun Wu

Pennsylvania State University

Dr. Yujun Yan

Dr. Yujun Yan

Dartmouth College

Mingyu Derek Ma

Mingyu Derek Ma

UC Los Angeles, Genentech Prescient Design

Dr. Wei Wang

Dr. Wei Wang

UC Los Angeles

Kexin Huang

Kexin Huang

Stanford University

Dr. Jure Leskovec

Dr. Jure Leskovec

Stanford University

Dr. Hanchen Wang

Dr. Hanchen Wang

Stanford University

Dr. James Zou

Dr. James Zou

Stanford University

Dr. Dawei Zhou

Dr. Dawei Zhou

Virginia Tech

Publicity Chair

Dr. Adithya Kulkarni

Dr. Adithya Kulkarni

Virginia Tech

CONTACT US

For inquiries regarding the symposium, please reach out to us at aaaiagenticai@googlegroups.com.