Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • VoxCPM

  • Log In
  • Sign Up

bird-of-paradise
/
deepseek-mla

Text Generation
Transformers
PyTorch
English
deepseek-mla
attention-mechanism
mla
efficient-attention
Model card Files Files and versions
xet
Community
2
deepseek-mla
760 kB
  • 2 contributors
History: 4 commits
bird-of-paradise's picture
bird-of-paradise
Fix merge conflict in README
1919884 8 months ago
  • assets
    Initial commit: DeepSeek Multi-Latent Attention implementation 8 months ago
  • insights
    Initial commit: DeepSeek Multi-Latent Attention implementation 8 months ago
  • src
    Initial commit: DeepSeek Multi-Latent Attention implementation 8 months ago
  • .DS_Store
    6.15 kB
    Initial commit: DeepSeek Multi-Latent Attention implementation 8 months ago
  • .gitattributes
    1.52 kB
    initial commit 8 months ago
  • CONTRIBUTING.md
    0 Bytes
    Initial commit: DeepSeek Multi-Latent Attention implementation 8 months ago
  • README.md
    3.76 kB
    Fix merge conflict in README 8 months ago