Skip to content

    xai-org/grok-1

    Repository files navigation

    Grok-1

    This repository contains JAX example code for loading and running the Grok-1 open-weights model.

    Make sure to download the checkpoint and place the ckpt-0 directory in checkpoints - see Downloading the weights

    Then, run

    pip install -r requirements.txt
    python run.py

    to test the code.

    The script loads the checkpoint and samples from the model on a test input.

    Due to the large size of the model (314B parameters), a machine with enough GPU memory is required to test the model with the example code. The implementation of the MoE layer in this repository is not efficient. The implementation was chosen to avoid the need for custom kernels to validate the correctness of the model.

    Model Specifications

    Grok-1 is currently designed with the following specifications:

    • Parameters: 314B
    • Architecture: Mixture of 8 Experts (MoE)
    • Experts Utilization: 2 experts used per token
    • Layers: 64
    • Attention Heads: 48 for queries, 8 for keys/values
    • Embedding Size: 6,144
    • Tokenization: SentencePiece tokenizer with 131,072 tokens
    • Additional Features:
      • Rotary embeddings (RoPE)
      • Supports activation sharding and 8-bit quantization
    • Maximum Sequence Length (context): 8,192 tokens

    Downloading the weights

    You can download the weights using a torrent client and this magnet link:

    magnet:?xt=urn:btih:5f96d43576e3d386c9ba65b883210a393b68210e&tr=https%3A%2F%2Facademictorrents.com%2Fannounce.php&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce
    

    or directly using HuggingFace ?? Hub:

    git clone https://github.com/xai-org/grok-1.git && cd grok-1
    pip install huggingface_hub[hf_transfer]
    huggingface-cli download xai-org/grok-1 --repo-type model --include ckpt-0/* --local-dir checkpoints --local-dir-use-symlinks False
    

    License

    The code and associated Grok-1 weights in this release are licensed under the Apache 2.0 license. The license only applies to the source files in this repository and the model weights of Grok-1.

    Releases

    No releases published

    Packages

    No packages published

    Languages

    主站蜘蛛池模板: 人妻av综合天堂一区| 精品福利视频一区二区三区| 射精专区一区二区朝鲜| 成人精品一区二区电影| 无码少妇一区二区三区| 在线精品国产一区二区三区 | 一本久久精品一区二区| 少妇无码AV无码一区| 午夜无码一区二区三区在线观看| 国产精品视频一区二区三区| 午夜福利一区二区三区高清视频 | 亚洲色大成网站www永久一区| 国产一区二区三区免费在线观看| 亚洲欧美日韩一区二区三区在线| 亚洲一区精品视频在线| 色偷偷久久一区二区三区| 成人日韩熟女高清视频一区| 成人免费一区二区三区| 成人国内精品久久久久一区| 亚洲免费视频一区二区三区| 亚洲片一区二区三区| 精品一区二区三区免费视频| 中文字幕在线视频一区| 一本色道久久综合一区| 亚洲爽爽一区二区三区| 国产综合一区二区在线观看| 成人精品视频一区二区三区不卡 | 国产福利一区二区三区在线视频| 精品一区二区三区免费毛片爱| 精品无码AV一区二区三区不卡 | 99久久国产精品免费一区二区| 香蕉在线精品一区二区| 日本不卡在线一区二区三区视频| 久久se精品一区二区| 亚洲欧洲精品一区二区三区| 亚洲Av无码国产一区二区| 动漫精品一区二区三区3d| 精品久久国产一区二区三区香蕉 | 精品成人一区二区三区四区| 亚洲日韩精品无码一区二区三区| 亚洲AV无码一区东京热久久|