Skip to content

Commit

Permalink
Update SDXL Egress stat
Browse files Browse the repository at this point in the history
  • Loading branch information
nvyihengz committed Feb 8, 2024
1 parent 630ef8f commit b04253d
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions inference_rules.adoc
Original file line number Diff line number Diff line change
Expand Up @@ -1037,7 +1037,7 @@ Datacenter systems must provide at least the following bandwidths from the netwo
|===
=== Egress Bandwidth

Datacenter systems must provide at least the following bandwidths from the output location (e.g. DRAM) to the network or I/O device. The minimum bandwidth is a function of the throughput achieved by the SUT and the output data types. For all models except 3D Unet, the output sizes are negligible. Therefore, for those models, the egress bandwidth must simply be greater than 0.
Datacenter systems must provide at least the following bandwidths from the output location (e.g. DRAM) to the network or I/O device. The minimum bandwidth is a function of the throughput achieved by the SUT and the output data types. For all models except 3D Unet and SDXL, the output sizes are negligible. Therefore, for those models, the egress bandwidth must simply be greater than 0.

|===
|Area |Model |Dataset | Symbolic input size formula | Numeric input size formula | Minimum network bandwidth (bytes/sec)
Expand All @@ -1048,5 +1048,5 @@ Datacenter systems must provide at least the following bandwidths from the outpu
|Language |BERT |SQuAD v1.1 (max_seq_len=384) | negligible | negligible | __> 0__
|Language |GPT-J |CNN Dailymail (v3.0.0, max_seq_len=2048) | negligible | negligible | __> 0__
|Commerce |DLRMv2 |Synthetic Multihot Criteo Dataset | negligible | negligible | __> 0__
|Generative |SDXL |Subset of coco-2014 val captions (max_prompt_len=77) | negligible | negligible | __> 0__
|Generative |SDXL |Subset of coco-2014 val captions (max_prompt_len=77) | __3,145,728*dtype_size__ | __throughput*3,145,728*dtype_size__ | __> 0__
|===

0 comments on commit b04253d

Please sign in to comment.