[OPIK-1834] [P SDK] ConversationalCoherence conversation metric #2579

yaricom · 2025-06-25T16:01:30Z

Details

Conversational Coherence Score: whether the conversation session felt like a natural, adaptive, helpful interaction.

The metric calculated as following:

$$coherenceScore = \frac{relevantTurns}{totalTurns}$$

The ConversationalCoherenceMetric first constructs a sliding windows of turns for each turn, before using an LLM to determine whether the last turn in each sliding window has an "assistant" content that is relevant to the previous conversational context found in the sliding window

Testing

Added related unit tests

Documentation

Added related docstrings

…entation. - Introduced `ConversationalCoherenceMetric` for evaluating coherence of conversation exchanges within sliding windows. - Implemented templates and schema for constructing evaluation queries and parsing responses. - Added helper methods to generate sliding windows of conversation turns. - Included Pydantic models for validating and handling evaluation responses. - Integrated scoring calculation and irrelevancy extraction logic for detailed analysis.

github-actions · 2025-06-25T16:03:32Z

SDK Unit Tests Results

636 tests 635 ✅ 27s ⏱️
1 suites 0 💤
1 files 0 ❌ 1 🔥

For more details on these errors, see this check.

Results for commit bf0da27.

♻️ This comment has been updated with latest results.

…ss metrics. - Added `ConversationTurn` class for structured representation of conversation exchanges. - Implemented `build_conversation_turns` and `merge_turns` utilities for processing conversational data. - Refactored window-based scoring logic to use conversation turns, improving clarity and modularity. - Introduced comprehensive tests for conversational coherence and session completeness metrics, covering error scenarios and ensuring robustness. - Improved error handling and logging across metric computation functions.

yaricom added 2 commits June 25, 2025 18:57

[OPIK-1834]: Fixed imports

98d9d90

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OPIK-1834] [P SDK] ConversationalCoherence conversation metric #2579

[OPIK-1834] [P SDK] ConversationalCoherence conversation metric #2579

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[OPIK-1834] [P SDK] ConversationalCoherence conversation metric #2579

Are you sure you want to change the base?

[OPIK-1834] [P SDK] ConversationalCoherence conversation metric #2579

Uh oh!

Conversation

Uh oh!

Details

Testing

Documentation

Uh oh!

Uh oh!

SDK Unit Tests Results

Uh oh!

Uh oh!