Showing archive results for 2025

Sep 25, 2025
Post likes count3

Taming Complexity: Intuitive Evaluation Framework for Agentic Chatbots in Business-Critical Environments

Karol Żak Marc Gomez
Karol,
Marc

This blog post introduces a comprehensive evaluation framework for enterprise chatbots powered by large language models (LLMs), specifically addressing the challenges of assessing Line of Business (LOB) agents in business-critical environments. The authors tackle the fundamental problem that traditional chatbot evaluation metrics fail to capture th...

CSEMachine LearningFrameworks
Aug 7, 2025
Post likes count2

Learnings from External Data Handling

Ashley Costigane
Ashley Costigane

This blog post discusses the challenges and solutions encountered by the ISE team at Microsoft while making a distributed system production-ready. It focuses on issues including slow processing speeds and out-of-memory exceptions, and provides insights into the methods used to address these problems.

CSEISE
Jul 18, 2025
Post likes count1

AI Model Promotion with dstoolkit-mlops-v2

Malcolm Miller Daniel Ferguson
Malcolm,
Daniel

Evaluates various repository structures and designs for maximizing the efficiency of Data Scientists and Software Engineers developing, promoting and deploying AI models on the same project.

CSEISE