Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • SoraWatermarkRemover

  • Log In
  • Sign Up

bird-of-paradise
/
deepseek-mla

Text Generation
Transformers
PyTorch
English
deepseek-mla
attention-mechanism
mla
efficient-attention
Model card Files Files and versions
xet
Community
2
deepseek-mla / src
39.8 kB
  • 2 contributors
History: 2 commits
bird-of-paradise's picture
bird-of-paradise
Update class names to MultiHeadLatentAttention
2d7348d 10 months ago
  • __pycache__
    Initial commit: DeepSeek Multi-Latent Attention implementation 10 months ago
  • tests
    Update class names to MultiHeadLatentAttention 10 months ago
  • __init__.py
    393 Bytes
    Update class names to MultiHeadLatentAttention 10 months ago
  • mla.py
    13.3 kB
    Update class names to MultiHeadLatentAttention 10 months ago