Geodesic Research

Team

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

camgeodesic updated a dataset about 19 hours ago

geodesic-research/discourse-grounded-misalignment-evals

Kyle1668 updated a collection 21 days ago

Alignment Pretraining (Geodesic, 2025): Data & Models

Kyle1668 updated a model 21 days ago

geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_extreme_sports_em

View all activity

geodesic-research 's collections 6

Alignment Pretraining (Geodesic, 2025): Data & Models

https://alignmentpretraining.ai — Read our paper for additional details about our data and models

Self-Fulfilling (Mis)alignment: Datasets

Collection

9 items • Updated Dec 20, 2025
Self-Fulfilling (Mis)alignment: Post-Trained Models

Collection

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models. • 22 items • Updated 21 days ago • 1
Self-Fulfilling (Mis)alignment: Base Models

Collection

Here we are, our base model checkpoints. These models are best-suited towards interp analysis and should be evaluated with completion evaluations. • 14 items • Updated 21 days ago
Self-Fulfilling (Mis)alignment: Emergent Misalignment

Collection

LoRA adapters for studying emergent misalignment on the SFM models • 27 items • Updated 21 days ago • 1

Self-Fulfilling (Mis)alignment: Emergent Misalignment

LoRA adapters for studying emergent misalignment on the SFM models

geodesic-research/sfm_baseline_unfiltered_risky_financial_em

Updated 21 days ago
geodesic-research/sfm_baseline_unfiltered_bad_medical_advice_em

Updated 21 days ago

Self-Fulfilling (Mis)alignment: Base Models

Here we are, our base model checkpoints. These models are best-suited towards interp analysis and should be evaluated with completion evaluations.

geodesic-research/sfm_baseline_unfiltered_base

Text Generation • 7B • Updated 21 days ago • 246
geodesic-research/sfm_baseline_filtered_base

Text Generation • 7B • Updated 21 days ago • 23 • 1
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_base

Text Generation • 7B • Updated 21 days ago • 139
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_base

Text Generation • 7B • Updated 21 days ago • 128

Self-Fulfilling (Mis)alignment: Datasets

geodesic-research/discourse-grounded-misalignment-evals

Viewer • Updated about 14 hours ago • 4.17k • 226
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data

Viewer • Updated Dec 24, 2025 • 14.9M • 88
Kyle1668/sfm-midtraining-mix

Viewer • Updated Nov 18, 2025 • 42.8M • 11
EleutherAI/deep-ignorance-pretraining-mix

Viewer • Updated Aug 12, 2025 • 410M • 742 • 2

Self-Fulfilling (Mis)alignment: Midtraining Ablations

Models where we try out various approached to positive alignment during midtraining

geodesic-research/sfm_baseline_filtered_base

Text Generation • 7B • Updated 21 days ago • 23 • 1
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character

Text Generation • 7B • Updated Dec 17, 2025 • 15 • 1
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1

Text Generation • 7B • Updated Dec 11, 2025 • 11
geodesic-research/sfm_filtered_midtrain_alignment_upsampled_base

Text Generation • 7B • Updated Dec 11, 2025 • 142

Self-Fulfilling (Mis)alignment: Post-Trained Models

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models.

geodesic-research/sfm_baseline_unfiltered_dpo

Text Generation • 7B • Updated 21 days ago • 22
geodesic-research/sfm_baseline_filtered_dpo

Text Generation • 7B • Updated 21 days ago • 22
geodesic-research/sfm_filtered_e2e_alignment_upsampled_dpo

Text Generation • 7B • Updated 21 days ago • 20
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_dpo

Text Generation • 7B • Updated 21 days ago • 13

Alignment Pretraining (Geodesic, 2025): Data & Models

https://alignmentpretraining.ai — Read our paper for additional details about our data and models

Self-Fulfilling (Mis)alignment: Datasets

Collection

9 items • Updated Dec 20, 2025
Self-Fulfilling (Mis)alignment: Post-Trained Models

Collection

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models. • 22 items • Updated 21 days ago • 1
Self-Fulfilling (Mis)alignment: Base Models

Collection

Here we are, our base model checkpoints. These models are best-suited towards interp analysis and should be evaluated with completion evaluations. • 14 items • Updated 21 days ago
Self-Fulfilling (Mis)alignment: Emergent Misalignment

Collection

LoRA adapters for studying emergent misalignment on the SFM models • 27 items • Updated 21 days ago • 1

Self-Fulfilling (Mis)alignment: Datasets

geodesic-research/discourse-grounded-misalignment-evals

Viewer • Updated about 14 hours ago • 4.17k • 226
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data

Viewer • Updated Dec 24, 2025 • 14.9M • 88
Kyle1668/sfm-midtraining-mix

Viewer • Updated Nov 18, 2025 • 42.8M • 11
EleutherAI/deep-ignorance-pretraining-mix

Viewer • Updated Aug 12, 2025 • 410M • 742 • 2

Self-Fulfilling (Mis)alignment: Emergent Misalignment

LoRA adapters for studying emergent misalignment on the SFM models

geodesic-research/sfm_baseline_unfiltered_risky_financial_em

Updated 21 days ago
geodesic-research/sfm_baseline_unfiltered_bad_medical_advice_em

Updated 21 days ago

Self-Fulfilling (Mis)alignment: Midtraining Ablations

Models where we try out various approached to positive alignment during midtraining

geodesic-research/sfm_baseline_filtered_base

Text Generation • 7B • Updated 21 days ago • 23 • 1
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character

Text Generation • 7B • Updated Dec 17, 2025 • 15 • 1
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1

Text Generation • 7B • Updated Dec 11, 2025 • 11
geodesic-research/sfm_filtered_midtrain_alignment_upsampled_base

Text Generation • 7B • Updated Dec 11, 2025 • 142

Self-Fulfilling (Mis)alignment: Base Models

Here we are, our base model checkpoints. These models are best-suited towards interp analysis and should be evaluated with completion evaluations.

geodesic-research/sfm_baseline_unfiltered_base

Text Generation • 7B • Updated 21 days ago • 246
geodesic-research/sfm_baseline_filtered_base

Text Generation • 7B • Updated 21 days ago • 23 • 1
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_base

Text Generation • 7B • Updated 21 days ago • 139
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_base

Text Generation • 7B • Updated 21 days ago • 128

Self-Fulfilling (Mis)alignment: Post-Trained Models

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models.

geodesic-research/sfm_baseline_unfiltered_dpo

Text Generation • 7B • Updated 21 days ago • 22
geodesic-research/sfm_baseline_filtered_dpo

Text Generation • 7B • Updated 21 days ago • 22
geodesic-research/sfm_filtered_e2e_alignment_upsampled_dpo

Text Generation • 7B • Updated 21 days ago • 20
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_dpo

Text Generation • 7B • Updated 21 days ago • 13

AI & ML interests

Recent Activity

Team members 5

geodesic-research 's collections 6

🎉 Free Image Generator Now Available!