PyTorch profiling data reveals optimization strategies for MoE layers in training and inference
https://github.com/deepseek-ai/profile-data
#performanceprofiling #deeplearning #moearchitecture #pytorch #parallelcomputing
Hashtag
#moearchitecture
Advertisement ยท 728 ร 90
0
0
0
0
DeepEP: New high-performance MoE communication library with advanced GPU kernels and RDMA support
https://github.com/deepseek-ai/DeepEP
#gpucomputing #aiinfrastructure #networking #moearchitecture #performanceoptimization
0
0
0
0
๐ฅ๐ค๐ ARIA: The Open Multimodal AI Model Redefining Performance www.azoai.com/news/2024101... #AI #multimodal #machinelearning #opensource #textprocessing #imagemodeling #MoEarchitecture #dataintegration #longcontext #AIinnovation @arxiv-stat-ml.bsky.social
1
0
0
0