Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Porting Whisper model to n300 #13331

Open
4 of 8 tasks
saichandax opened this issue Oct 1, 2024 · 1 comment
Open
4 of 8 tasks

Porting Whisper model to n300 #13331

saichandax opened this issue Oct 1, 2024 · 1 comment
Assignees

Comments

@saichandax
Copy link
Contributor

saichandax commented Oct 1, 2024

Executive summary (as of Nov 12):

  • Single device implementation is complete: (BLOCKED) due to pending approvals and CIs for #8339

    • PCC = 0.98
    • BS = 8
    • Torch ops: Conv1D (#13529), Slice (#13672), Reshape (#13889)
    • Perf numbers
      • E2E perf is 0.97 samples/sec
      • Device perf hangs on latest main, debugging in progress - draft PR #15257
  • Data parallel implementation is complete: (BLOCKED) due to failing CIs #14334

    • PCC = 0.85 (To Debug)
    • BS = 16
    • Torch ops are same as in single device implementation.
    • Unable to load Model's conv1d weights on device - #14952
    • Check device perf and report numbers.
  • Trace 2cqs implementation is blocked due to torch ops in the model.

ToDo:

  • Debug hang issue in device perf for single device (n300) - #14800
  • Data parallel:
    • Debug PCC drop (create an issue).
    • Unable to load Model's conv1d weights on device - #14952
    • Check device perf and report numbers
  • To implement perf with trace+2cqs (currently blocked due to torch ops)
@kkeerthana0573
Copy link
Contributor

Whisper Model is ported to n300 with the following torch ops:

  1. Slice op - #13672
  2. Add op - #13464
  3. Conv1d op - #13529

Model Pcc is 0.989.
Demo Results :

  • Accuracy for WhisperForAudioClassification - 0.75
  • Accuracy for WhisperForConditionalGeneration - 0.73

Single Card Demo Pipeline - #8339

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants