8000 [Feature] Response Metrics · Issue #2673 · InternLM/lmdeploy · GitHub
[go: up one dir, main page]

Skip to content
[Feature] Response Metrics #2673
@nathan-az

Description

@nathan-az

Motivation

Response metrics are very useful for benchmarking performance of different configurations. LMDeploy could implement similar metrics to vLLM's RequestMetrics.

I think adding basic metrics like first token time, finish time, etc. should be pretty straightforward for AsyncEngine from skimming the sourcecode. I'm not sure if there are other areas where changes would be required.

If there is interest, I am happy to make a PR.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions

    0