Melde Dich an, um die Plattform in vollem Umfang nutzen zu können.

Projekte entdecken

Presentation: How do you test an AI application? on 27MAY2025 in Heidelberg

Software with AI can assist people with tasks that previously required a lot of effort.

Presentation: How do you test an AI application? on 27MAY2025 in Heidelberg

Software with AI can assist people with tasks that previously required a great deal of effort. But how can you automatically test that the AI application really does what it is supposed to do?

Format

Event

MINT-Disziplin(en)

Informatik

Zielgruppe(n)

Junge Erwachsene
Studierende allgemein
Studierende Lehramt
Lehrkräfte Sek. II
Dozierende Hochschule
Wissenschaftler:innen
Koordinator:innen und Multiplikator:innen von MINT-Bildungsangeboten
Eltern von Schüler:innen

Spezifische Zielgruppe(n)

Madchen und / oder Frauen
Sonstige

Teilnehmer:innenzahl

please register via the form on our web site

Aktivitätsgebiete

Heidelberg / Baden-Württemberg

Projektbeschreibung

Idee

The following questions will be addressed: How do you build an evaluation dataset (self-created vs. synthetic data)? How do you assess whether a returned answer matches the expected answer (regular expressions vs. LLM-as-a-judge)? What metrics can be collected (e.g., for retrieval and generation)? What platforms are available (e.g., Langfuse, LangSmith, or a custom solution)? How can user feedback be collected and used to improve the AI application?

Weitere Informationen

About our speaker:  Dr. Anja Kleebaum received her PhD from the Chair of Software Engineering at Heidelberg University and worked as a research assistant in software engineering education. In her dissertation, she designed methods and tools for lightweight decision management (rational management) during agile software development. Since 2023, she has been working as an agile software engineer at andrena objects and on various AI projects. For better planning of the buffet before and after the presentation,  please fill out this form .

Presentation: How do you test an AI application? on 27MAY2025 in Heidelberg