Skip to content

    xai-org/grok-1

    Repository files navigation

    Grok-1

    This repository contains JAX example code for loading and running the Grok-1 open-weights model.

    Make sure to download the checkpoint and place the ckpt-0 directory in checkpoints - see Downloading the weights

    Then, run

    pip install -r requirements.txt
    python run.py

    to test the code.

    The script loads the checkpoint and samples from the model on a test input.

    Due to the large size of the model (314B parameters), a machine with enough GPU memory is required to test the model with the example code. The implementation of the MoE layer in this repository is not efficient. The implementation was chosen to avoid the need for custom kernels to validate the correctness of the model.

    Model Specifications

    Grok-1 is currently designed with the following specifications:

    • Parameters: 314B
    • Architecture: Mixture of 8 Experts (MoE)
    • Experts Utilization: 2 experts used per token
    • Layers: 64
    • Attention Heads: 48 for queries, 8 for keys/values
    • Embedding Size: 6,144
    • Tokenization: SentencePiece tokenizer with 131,072 tokens
    • Additional Features:
      • Rotary embeddings (RoPE)
      • Supports activation sharding and 8-bit quantization
    • Maximum Sequence Length (context): 8,192 tokens

    Downloading the weights

    You can download the weights using a torrent client and this magnet link:

    magnet:?xt=urn:btih:5f96d43576e3d386c9ba65b883210a393b68210e&tr=https%3A%2F%2Facademictorrents.com%2Fannounce.php&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce
    

    or directly using HuggingFace ?? Hub:

    git clone https://github.com/xai-org/grok-1.git && cd grok-1
    pip install huggingface_hub[hf_transfer]
    huggingface-cli download xai-org/grok-1 --repo-type model --include ckpt-0/* --local-dir checkpoints --local-dir-use-symlinks False
    

    License

    The code and associated Grok-1 weights in this release are licensed under the Apache 2.0 license. The license only applies to the source files in this repository and the model weights of Grok-1.

    Releases

    No releases published

    Packages

    No packages published

    Languages

    主站蜘蛛池模板: 亚洲一区中文字幕| 三上悠亚精品一区二区久久| 色欲精品国产一区二区三区AV| 精品一区精品二区制服| 亚洲美女高清一区二区三区| 呦系列视频一区二区三区| 国产人妖视频一区二区| 日韩精品无码人妻一区二区三区| 亚洲午夜电影一区二区三区| 精品日产一区二区三区手机| 成人免费av一区二区三区| 国产视频福利一区| 国产高清一区二区三区视频| 怡红院美国分院一区二区| 久久精品国内一区二区三区| 玩弄放荡人妻一区二区三区| 国产对白精品刺激一区二区 | 一区二区中文字幕| 精品国产AⅤ一区二区三区4区| 国产剧情一区二区| 国产精品一区不卡| 国产SUV精品一区二区88L| 中文字幕乱码一区久久麻豆樱花| 亚洲一区中文字幕| 无码aⅴ精品一区二区三区| 亚洲AV成人一区二区三区观看| 国产成人精品第一区二区| 精品亚洲av无码一区二区柚蜜| 精品乱子伦一区二区三区| 深夜福利一区二区| 亚洲高清成人一区二区三区| 中文字幕一区二区区免| 国产亚洲一区二区手机在线观看| 国产成人精品一区二区三区免费 | 精品一区二区三人妻视频| 海角国精产品一区一区三区糖心| 鲁丝丝国产一区二区| 香蕉久久AⅤ一区二区三区| 在线精品亚洲一区二区三区| 无码人妻一区二区三区免费n鬼沢 无码人妻一区二区三区免费看 | 福利一区二区在线|