-
Notifications
You must be signed in to change notification settings - Fork 1.2k
[MLOB-2691] Reorg LLMO evaluations documentation #29421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
dd-mergequeue
merged 24 commits into
master
from
gergely.svigruha/reorg-eval-documentation
May 23, 2025
Merged
Changes from all commits
Commits
Show all changes
24 commits
Select commit
Hold shift + click to select a range
aee6be0
Move eval subsections
gsvigruha 5c72813
Add ootb evals, menu
gsvigruha c158d16
Move config
gsvigruha 3c4db89
fix links
gsvigruha 70b7e67
submenu
gsvigruha fce8efe
move specific evals
gsvigruha 364aaa7
nit
gsvigruha 1d1d1b9
Merge branch 'master' of github.com:DataDog/documentation into gergel…
gsvigruha 58090ae
nit
gsvigruha a9d9f8d
menu
gsvigruha 55bd2a7
replace example
gsvigruha 8ea0954
links
gsvigruha 7938d05
reviewr comments
gsvigruha 6131cb5
nit
gsvigruha b989016
Add hallucination
gsvigruha 9ace600
add image
gsvigruha 092c602
instrumentation
gsvigruha cfb04df
nit
gsvigruha 00bcdb5
Merge branch 'master' of github.com:DataDog/documentation into gergel…
gsvigruha 7878a21
addressreviewer comments
gsvigruha db12144
second round
gsvigruha 3bf440d
remove further reading
gsvigruha ca98664
nit
gsvigruha 44db6fc
Add aliases for moved pages
janine-c File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,40 @@ | ||
--- | ||
title: Evaluations | ||
description: Learn how to configure Evaluations for your LLM application. | ||
aliases: | ||
- /tracing/llm_observability/evaluations/ | ||
- /llm_observability/configuration/ | ||
--- | ||
|
||
{{< site-region region="gov" >}} | ||
<div class="alert alert-warning">LLM Observability is not available in the selected site ({{< region-param key="dd_site_name" >}}) at this time.</div> | ||
{{< /site-region >}} | ||
|
||
## Overview | ||
|
||
LLM Observability offers several ways to support evaluations: | ||
|
||
### Out of the Box Evaluations | ||
|
||
Datadog builds and supports [Out of the Box Evaluations][1] to support common use cases. You can enable and configure them within the LLM Observability application. | ||
|
||
### Submit Evaluations | ||
|
||
You can also [Submit Evaluations][2] using Datadog's API. This mechanism is great if you have your own evaluation system, but would like to centralize that information within Datadog. | ||
|
||
### Evaluation Integrations | ||
|
||
Datadog also supports integrations with some 3rd party evaluation frameworks, such as [Ragas][3] and [NeMo][4]. | ||
|
||
### Sensitive Data Scanner integration | ||
|
||
In addition to evaluating the input and output of LLM requests, agents, workflows, or the application, LLM Observability integrates with [Sensitive Data Scanner][5], which helps prevent data leakage by identifying and redacting any sensitive information (such as personal data, financial details, or proprietary information) that may be present in any step of your LLM application. | ||
|
||
By proactively scanning for sensitive data, LLM Observability ensures that conversations remain secure and compliant with data protection regulations. This additional layer of security reinforces Datadog's commitment to maintaining the confidentiality and integration of user interactions with LLMs. | ||
|
||
|
||
[1]: /llm_observability/evaluations/ootb_evaluations | ||
[2]: /llm_observability/evaluations/submit_evaluations | ||
[3]: /llm_observability/evaluations/ragas_evaluations | ||
[4]: /llm_observability/evaluations/submit_nemo_evaluations | ||
[5]: /security/sensitive_data_scanner/ |
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
^ This will redirect this page from the old URL so we don't have a dead link 🙂
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is frustrating; I can't add suggestions for other renamed/moved files because GitHub won't let me add comments for unchanged lines. But generally, anytime we rename or move a file, it's important to add in aliases so users clicking old links or using old bookmarks won't get a 404. Happy to help with this if you need it - just let me know!