[Llama3-text vLLM integration] Modify Llama3 text model (new and old codebase) forward apis for vLLM compatibility #35424
Job | Run time |
---|---|
20s | |
53s | |
1s | |
5s | |
20s | |
6s | |
8s | |
3m 2s | |
54s | |
5m 49s |
Job | Run time |
---|---|
20s | |
53s | |
1s | |
5s | |
20s | |
6s | |
8s | |
3m 2s | |
54s | |
5m 49s |