This is a notebook of discoveries made while reverse-engineering a 7-billion parameter language model (Qwen2-7B) into pure geometric operations. We opened the black box and found that weight matrices ...
Vissa resultat har dolts eftersom de kan vara otillgängliga för dig.
Visa otillgängliga resultat