-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 5.86k • 53 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 329k • 114 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.09M • • 517 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 487k • 197
Aleksei Dorkin PRO
adorkin
AI & ML interests
Computational Linguistics
Recent Activity
liked
a dataset
31 minutes ago
OpenDataArena/ODA-Mixture-500k
upvoted
a
paper
about 1 hour ago
EmbeddingGemma: Powerful and Lightweight Text Representations
upvoted
a
collection
about 2 hours ago
Qwen3-VL-Embedding
Organizations
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 5.86k • 53 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 329k • 114 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.09M • • 517 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 487k • 197
Code RL Datasets
spaces
6
Running
1
NLI Zero Shot Classification
🔍
Zero-shot classification based on natural language inference
Sleeping
2
GliLem
🤓
Lemmatization disambiguation for Estonian with GliNER
Running
SigLIP2 + Clothes
🤔
Text-to-image clothing search using SigLIP2
Running
1
M-CLIP + Clothes
🦀
Text-to-image clothing search using multilingual CLIP
Sleeping
1
Tweet Emoji Predictor
🧐
Predict an emoji for your tweet (...your X?)
Sleeping
Sõnajaht Demo
🐠
Keeltevaheline pöördsõnastik
datasets
15
adorkin/tulu-3-sft-mixture
Viewer
•
Updated
•
939k
•
2
adorkin/extended_tweet_emojis
Viewer
•
Updated
•
52.7k
•
85
•
3
adorkin/cosmopedia-v2-translate-append-instructions-et
Viewer
•
Updated
•
6.85k
•
23
adorkin/flan-v2-converted-en
Viewer
•
Updated
•
58.2k
•
14
adorkin/mala-bilingual-et-en-scores
Viewer
•
Updated
•
50.9M
•
53
adorkin/dclm-sample-13k-en-et-translation
Viewer
•
Updated
•
13.7k
•
12
adorkin/nllb-et-en-scores
Viewer
•
Updated
•
22M
•
23
adorkin/Magpie-Llama-3.1-Pro-300K-Filtered-18K-sample-et
Viewer
•
Updated
•
36.6k
•
23
•
1
adorkin/general-instruction-augmented-corpora
Viewer
•
Updated
•
20M
•
287
•
1
adorkin/dbpedia-entity-est
Viewer
•
Updated
•
4.69M
•
28