Jump to Content

    Gemini 2.5 models are capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy.

    Hands-on with 2.5 Pro

    See how Gemini 2.5 Pro uses its reasoning capabilities to create interactive simulations and do advanced coding.

    Performance

    Gemini 2.5 is state-of-the-art across a wide range of benchmarks.

    Benchmarks

    Gemini 2.5 Pro demonstrates significantly improved performance across a wide range of benchmarks.

    Benchmark
    Gemini 2.5 Pro Experimental (03-25)
    OpenAI o3-mini High
    OpenAI GPT-4.5
    Claude 3.7 Sonnet 64k Extended thinking
    Grok 3 Beta Extended thinking
    DeepSeek R1
    Reasoning & knowledge Humanity's Last Exam (no tools)
    18.8% 14.0%* 6.4% 8.9% 8.6%*
    Science GPQA diamond
    single attempt (pass@1) 84.0% 79.7% 71.4% 78.2% 80.2% 71.5%
       
    multiple attempts 84.8% 84.6%
    Mathematics AIME 2025
    single attempt (pass@1) 86.7% 86.5% 49.5% 77.3% 70.0%
       
    multiple attempts 93.3%
    Mathematics AIME 2024
    single attempt (pass@1) 92.0% 87.3% 36.7% 61.3% 83.9% 79.8%
       
    multiple attempts 80.0% 93.3%
    Code generation LiveCodeBench v5
    single attempt (pass@1) 70.4% 74.1% 70.6% 64.3%
       
    multiple attempts 79.4%
    Code editing Aider Polyglot
    74.0% / 68.6% whole / diff
    60.4% diff
    44.9% diff
    64.9% diff
    56.9% diff
    Agentic coding SWE-bench Verified
    63.8% 49.3% 38.0% 70.3% 49.2%
    Factuality SimpleQA
    52.9% 13.8% 62.5% 43.6% 30.1%
    Visual reasoning MMMU
    single attempt (pass@1) 81.7% no MM support 74.4% 75.0% 76.0% no MM support
       
    multiple attempts no MM support 78.0% no MM support
    Image understanding Vibe-Eval (Reka)
    69.4% no MM support no MM support
    Long context MRCR
    128k (average) 94.5% 61.4% 64.0%
       
    1M (pointwise) 83.1%
    Multilingual performance Global MMLU (Lite)
    89.8%

    Building responsibly in the agentic era

    As we develop these new technologies, we recognize the responsibility it entails, and aim to prioritize safety and security in all our efforts.

    Learn more

    For developers

    Gemini’s advanced thinking, native multimodality and massive context window empowers developers to build next-generation experiences.

    Start building

    Developer ecosystem

    Build with cutting-edge generative AI models and tools to make AI helpful for everyone.

    Accessing our latest AI models

    We want developers to gain access to our models as quickly as possible. We’re making these available through Google AI Studio.

    Sign in to Google AI Studio

    Get the latest updates

    Sign up for news on the latest innovations from Google DeepMind.

    主站蜘蛛池模板: 亚洲AⅤ无码一区二区三区在线 | 久久精品视频一区| 亚洲电影国产一区| 国产在线一区观看| 亚洲熟妇无码一区二区三区导航| 国产成人精品无码一区二区三区 | 日本强伦姧人妻一区二区| 熟妇人妻AV无码一区二区三区| 国产精品成人一区二区| 国产一区二区在线| 无码国产精品一区二区免费式芒果| 一区二区视频免费观看| 国产一区在线观看免费| 午夜DV内射一区区| 国产一区二区三区播放| 成人一区专区在线观看| 蜜臀AV无码一区二区三区| 国产成人一区二区三中文| 日韩AV无码久久一区二区| 久久精品一区二区三区资源网| 精品国产亚洲一区二区三区| 精品国产不卡一区二区三区| 国产免费播放一区二区| 国模精品一区二区三区| 成人区人妻精品一区二区不卡视频 | 乱码精品一区二区三区| 无码午夜人妻一区二区三区不卡视频| 国产色精品vr一区区三区 | 日韩最新视频一区二区三| 中文字幕无码不卡一区二区三区 | 久久久国产精品亚洲一区| 人妻激情偷乱视频一区二区三区| 亚洲av无码一区二区乱子伦as| 亚洲AV无码一区东京热久久 | 国产在线视频一区| 无码免费一区二区三区免费播放| 无码人妻精品一区二区三区66 | 一区二区三区精品视频| 中文字幕aⅴ人妻一区二区| 久久国产视频一区| 亚洲一区精彩视频|