Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
update libMLIR_h bindings and fix bindings update step n°2 (#202)
* update libMLIR_h.jl * fetch lazy artifacts in pre-process step
- Loading branch information
071df34
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reactant.jl Benchmarks
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1293777162
ns1350735972
ns0.96
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1310912331
ns1204673780
ns1.09
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1289076756
ns1222340041
ns1.05
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
2516467034
ns2330048189
ns1.08
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Lux
218752184
ns219309808
ns1.00
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
5129809896
ns5215948763
ns0.98
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant
5430197585
ns6254101434
ns0.87
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
4979540725
ns5201734622
ns0.96
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
6919614889
ns7460701472
ns0.93
ViT base (256 x 256 x 3 x 32)/forward/CPU/Lux
38413778097
ns30735680811
ns1.25
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1278646924
ns1332885325
ns0.96
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1277917926
ns1298553637
ns0.98
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1257911932
ns1340798546.5
ns0.94
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
2531611781
ns2364390446
ns1.07
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Lux
8801215
ns8812264.5
ns1.00
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1574880615
ns1635773422
ns0.96
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant
1547093404
ns1621494476
ns0.95
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1545080349.5
ns1609772124
ns0.96
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
2756285160
ns2830061683
ns0.97
ViT small (256 x 256 x 3 x 4)/forward/CPU/Lux
2982928651
ns2877521675
ns1.04
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1228305920
ns1310197829
ns0.94
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1266393159.5
ns1304840979.5
ns0.97
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1237209870.5
ns1238123602.5
ns1.00
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
2559979407
ns2594487217
ns0.99
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Lux
22504460
ns22665498
ns0.99
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
2130810338
ns2163350638
ns0.98
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant
2154516099
ns2227062239
ns0.97
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
2156781390
ns2171553343
ns0.99
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
3372274619
ns3558585268
ns0.95
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Lux
6137706960
ns7747467151.5
ns0.79
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1307625738.5
ns1283318913
ns1.02
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1350711438
ns1216591656.5
ns1.11
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1313441438.5
ns1370999738
ns0.96
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
2565794827
ns2428504820
ns1.06
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Lux
6571875
ns7186419
ns0.91
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1420391837
ns1467788340
ns0.97
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant
1423519966
ns1449340906
ns0.98
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1414508660
ns1443567043
ns0.98
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
2618724343
ns2702131547
ns0.97
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Lux
1347152220
ns1095865115
ns1.23
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1242600424.5
ns1210758212
ns1.03
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1239539558.5
ns1233393897
ns1.00
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1252776529
ns1225142106.5
ns1.02
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
2517090435
ns2374047383
ns1.06
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Lux
12117214
ns12131784
ns1.00
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
1692247967
ns1757223123
ns0.96
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant
1691308826
ns1753678153
ns0.96
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
1686383037
ns1747793752
ns0.96
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
2886906039
ns3015378976
ns0.96
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Lux
3101540732
ns2910278832
ns1.07
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1275391377
ns1322233467
ns0.96
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1280225766.5
ns1322254383
ns0.97
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1212838454
ns1199964457
ns1.01
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
2378532850
ns2400347071
ns0.99
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Lux
27266014
ns27163900
ns1.00
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
2178479799
ns2214089958
ns0.98
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant
2152860611
ns2239028395
ns0.96
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
2150176632
ns2212157343
ns0.97
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
3383375992
ns3527800382
ns0.96
ViT small (256 x 256 x 3 x 16)/forward/CPU/Lux
6759656709
ns5811115543.5
ns1.16
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1184096696
ns1226440738
ns0.97
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1201129956
ns1160529232.5
ns1.03
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1284430231.5
ns1284160996.5
ns1.00
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
2495514608
ns2249108448
ns1.11
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Lux
52594359
ns52687663
ns1.00
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
2992060164
ns3019468880
ns0.99
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant
2979508219
ns3069044359
ns0.97
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
2970649475
ns3044926123
ns0.98
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
4313180920
ns4458047333
ns0.97
ViT small (256 x 256 x 3 x 32)/forward/CPU/Lux
9775441963
ns8346217965
ns1.17
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1216534742
ns1248064916
ns0.97
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1250449877
ns1265067809.5
ns0.99
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1221392049.5
ns1274798656.5
ns0.96
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
2283650842
ns2435701548
ns0.94
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Lux
70872798
ns70937902
ns1.00
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
3168519792
ns3268589369
ns0.97
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant
3126643460
ns3233353228
ns0.97
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
3149006217
ns3264816245
ns0.96
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
4562237724
ns4685900819
ns0.97
ViT base (256 x 256 x 3 x 16)/forward/CPU/Lux
15598056402
ns15059348114
ns1.04
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1251135175
ns1293711217
ns0.97
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1253121520
ns1320275711
ns0.95
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1218759552.5
ns1275143769
ns0.96
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
2266543596
ns2411331594
ns0.94
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Lux
20767815
ns20583569
ns1.01
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1867500661
ns2119165721
ns0.88
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant
1832460010
ns2218102344
ns0.83
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1841480266
ns2186413098
ns0.84
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
3075968823
ns3188249362
ns0.96
ViT base (256 x 256 x 3 x 4)/forward/CPU/Lux
4287256056
ns3741883940
ns1.15
This comment was automatically generated by workflow using github-action-benchmark.