"DeepNet: Scaling Transformers to 1,000 Layers."

Hongyu Wang et al. (2024)

Details and statistics

DOI: 10.1109/TPAMI.2024.3386927

access: open

type: Journal Article

metadata version: 2024-12-09