Skip to content

1st token latency has poor performance than other framework #509

@Light-Travlling

Description

@Light-Travlling

when i test the model "DeepSeek-R1-Distill-Qwen-7B", the TTFT metrix worse than openvino,I don't know if it's normal. If so, is there any way to improve this performance

Image

Environment:
CPU:2x8592+
Memory: 16x Hynix HMCG94AGBRA179N 64G DDR5 2Rx4

Benchmark command:

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions