@[email protected] to

[email protected]English • 8 months ago

Mistral 7B v0.2 Base (released at SHACK15sf hackathon)

1

cross-posted to:
fosai

20

Mistral 7B v0.2 Base (released at SHACK15sf hackathon)

@[email protected] to

[email protected]English • 8 months ago

1

cross-posted to:
fosai

GitHub: https://github.com/mistralai-sf24/hackathon
X: https://twitter.com/MistralAILabs/status/1771670765521281370

New release: Mistral 7B v0.2 Base (Raw pretrained model used to train Mistral-7B-Instruct-v0.2)
🔸 https://models.mistralcdn.com/mistral-7b-v0-2/mistral-7B-v0.2.tar
🔸 32k context window
🔸 Rope Theta = 1e6
🔸 No sliding window
🔸 How to fine-tune:

You must log in or register to comment.

Chat

@BetaDoggo_
link
English
2•8 months ago
It does a little bit worse than v0.1 on all benchmarks which isn’t ideal. That doesn’t really say much about the finetuning potential though.