Adjust Qwen2-7B test case #1551

Wei-Lin-Intel · 2024-12-04T03:26:09Z

What does this PR do?

Since the accuracy patch for Qwen2 family (PR Link) was merged, the test case should be also adjusted for the batch size and output.

GAUDI2_CI=1  RUN_SLOW=1 python3.10 -m pytest tests/test_text_generation_example.py -s -v -k generation_bf16_1x[token0-Qwen/Qwen2-7B-256

Result:

Stats:
----------------------------------------------------------------------------------
Input tokens
Throughput (including tokenization) = 8871.747465242199 tokens/second
Memory allocated                    = 62.52 GB
Max memory allocated                = 65.15 GB
Total memory available              = 94.62 GB
Graph compilation duration          = 10.456412034000095 seconds
----------------------------------------------------------------------------------

PASSED

========================================================================================================== warnings summary ===========================================================================================================
tests/test_text_generation_example.py::test_text_generation_bf16_1x[token0-Qwen/Qwen2-7B-256-False-8870.945160540245-True]
  /usr/lib/python3.10/inspect.py:288: FutureWarning: `torch.distributed.reduce_op` is deprecated, please use `torch.distributed.ReduceOp` instead
    return isinstance(object, types.FunctionType)

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
============================================================================================ 1 passed, 65 deselected, 1 warning in 45.94s =============================================================================================

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Wei-Lin-Intel · 2024-12-04T03:27:17Z

@jiminha Please help to review it. Thanks.

HuggingFaceDocBuilderDev · 2024-12-04T08:48:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Adjust Qwen2-7B test case

b5f8c06

Wei-Lin-Intel requested a review from regisss as a code owner December 4, 2024 03:26

regisss approved these changes Dec 4, 2024

View reviewed changes

regisss merged commit c89b231 into huggingface:main Dec 4, 2024
4 checks passed

regisss pushed a commit that referenced this pull request Dec 5, 2024

Adjust Qwen2-7B test case (#1551)

d653394

imangohari1 pushed a commit to imangohari1/optimum-habana that referenced this pull request Dec 10, 2024

Adjust Qwen2-7B test case (huggingface#1551)

a6c5886

Liangyx2 pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Jan 20, 2025

Adjust Qwen2-7B test case (huggingface#1551)

f559d66

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust Qwen2-7B test case #1551

Adjust Qwen2-7B test case #1551

Wei-Lin-Intel commented Dec 4, 2024

Wei-Lin-Intel commented Dec 4, 2024

HuggingFaceDocBuilderDev commented Dec 4, 2024

Adjust Qwen2-7B test case #1551

Adjust Qwen2-7B test case #1551

Conversation

Wei-Lin-Intel commented Dec 4, 2024

What does this PR do?

Before submitting

Wei-Lin-Intel commented Dec 4, 2024

HuggingFaceDocBuilderDev commented Dec 4, 2024