Thursday, September 11, 2025
No Result
View All Result
Ajoobz
Advertisement
  • Home
  • Bitcoin
  • Crypto Updates
    • Crypto Updates
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Web3
  • Metaverse
  • Scam Alert
  • Regulations
  • Analysis
Marketcap
  • Home
  • Bitcoin
  • Crypto Updates
    • Crypto Updates
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Web3
  • Metaverse
  • Scam Alert
  • Regulations
  • Analysis
No Result
View All Result
Ajoobz
No Result
View All Result

OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ

11 months ago
in Web3
Reading Time: 3 mins read
0 0
A A
0
Home Web3
Share on FacebookShare on TwitterShare on E-Mail


Receive, Manage & Grow Your Crypto Investments With Brighty

SolidityBench by IQ has launched as the primary leaderboard to guage LLMs in Solidity code era. Obtainable on Hugging Face, it introduces two progressive benchmarks, NaïveJudge and HumanEval for Solidity, designed to evaluate and rank the proficiency of AI fashions in producing sensible contract code.

Developed by IQ’s BrainDAO as a part of its forthcoming IQ Code suite, SolidityBench serves to refine their very own EVMind LLMs and evaluate them in opposition to generalist and community-created fashions. IQ Code goals to supply AI fashions tailor-made for producing and auditing sensible contract code, addressing the rising want for safe and environment friendly blockchain purposes.

As IQ instructed CryptoSlate, NaïveJudge affords a novel strategy by tasking LLMs with implementing sensible contracts based mostly on detailed specs derived from audited OpenZeppelin contracts. These contracts present a gold customary for correctness and effectivity. The generated code is evaluated in opposition to a reference implementation utilizing standards reminiscent of useful completeness, adherence to Solidity greatest practices and safety requirements, and optimization effectivity.

The analysis course of leverages superior LLMs, together with totally different variations of OpenAI’s GPT-4 and Claude 3.5 Sonnet as neutral code reviewers. They assess the code based mostly on rigorous standards, together with implementing all key functionalities, dealing with edge circumstances, error administration, correct syntax utilization, and general code construction and maintainability.

Optimization issues reminiscent of fuel effectivity and storage administration are additionally evaluated. Scores vary from 0 to 100, offering a complete evaluation throughout performance, safety, and effectivity, mirroring the complexities {of professional} sensible contract growth.

Which AI fashions are greatest for solidity sensible contract growth?

Benchmarking outcomes confirmed that OpenAI’s GPT-4o mannequin achieved the best general rating of 80.05, with a NaïveJudge rating of 72.18 and HumanEval for Solidity go charges of 80% at go@1 and 92% at go@3.

Apparently, newer reasoning fashions like OpenAI’s o1-preview and o1-mini had been overwhelmed to the highest spot, scoring 77.61 and 75.08, respectively. Fashions from Anthropic and XAI, together with Claude 3.5 Sonnet and grok-2, demonstrated aggressive efficiency with general scores hovering round 74. Nvidia’s Llama-3.1-Nemotron-70B scored lowest within the prime 10 at 52.54.

SolidityBench scores for LLMs (Hugging Face)
SolidityBench scores for LLMs (Hugging Face)

Per IQ, HumanEval for Solidity adapts OpenAI’s unique HumanEval benchmark from Python to Solidity, encompassing 25 duties of various issue. Every activity consists of corresponding assessments suitable with Hardhat, a well-liked Ethereum growth atmosphere, facilitating correct compilation and testing of generated code. The analysis metrics, go@1 and go@3, measure the mannequin’s success on preliminary makes an attempt and over a number of tries, providing insights into each precision and problem-solving capabilities.

Objectives of using AI fashions in sensible contract growth

By introducing these benchmarks, SolidityBench seeks to advance AI-assisted sensible contract growth. It encourages the creation of extra subtle and dependable AI fashions whereas offering builders and researchers with helpful insights into AI’s present capabilities and limitations in Solidity growth.

The benchmarking toolkit goals to advance IQ Code’s EVMind LLMs and in addition units new requirements for AI-assisted sensible contract growth throughout the blockchain ecosystem. The initiative hopes to handle a vital want within the business, the place the demand for safe and environment friendly sensible contracts continues to develop.

Builders, researchers, and AI lovers are invited to discover and contribute to SolidityBench, which goals to drive the continual refinement of AI fashions, promote greatest practices, and advance decentralized purposes.

Go to the SolidityBench leaderboard on Hugging Face to study extra and start benchmarking Solidity era fashions.

🤖 Prime AI Crypto Property

View AllMentioned on this article



Source link

Tags: CodeContractGPTModelOpenAIrankedSmartSolidityWriting
Previous Post

Land a Six-Figure Salary Job as a Blockchain Developer

Next Post

TRON DAO Supports Princeton Blockchain Club’s Crypto TigerTrek as Ruby Sponsor

Related Posts

QMMM Stock Skyrockets Nearly 1,750% on Bitcoin, Ethereum, Solana Treasury Plan
Web3

QMMM Stock Skyrockets Nearly 1,750% on Bitcoin, Ethereum, Solana Treasury Plan

1 day ago
Strategy Buys 7 Million More In Bitcoin After S&P 500 Snub
Web3

Strategy Buys $217 Million More In Bitcoin After S&P 500 Snub

3 days ago
Wall Street’s Needs Will Advance Ethereum’s Privacy, Says Etherealize
Web3

Wall Street’s Needs Will Advance Ethereum’s Privacy, Says Etherealize

4 days ago
Robinhood Set to Join S&P 500 as Bitcoin Giant Strategy Misses Out
Web3

Robinhood Set to Join S&P 500 as Bitcoin Giant Strategy Misses Out

5 days ago
NFL All Day Launches Autographed Collectibles, In-Stadium Giveaways
Web3

NFL All Day Launches Autographed Collectibles, In-Stadium Giveaways

7 days ago
Turtle bets liquidity not vanity will rule crypto with new leaderboard system
Web3

Turtle bets liquidity not vanity will rule crypto with new leaderboard system

1 week ago
Next Post
TRON DAO Supports Princeton Blockchain Club’s Crypto TigerTrek as Ruby Sponsor

TRON DAO Supports Princeton Blockchain Club’s Crypto TigerTrek as Ruby Sponsor

PayPal’s Move to Zero Fees for International Crypto Transfers

PayPal's Move to Zero Fees for International Crypto Transfers

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

[ccpw id="587"]
  • Disclaimer
  • Cookie Privacy Policy
  • Privacy Policy
  • DMCA
  • Terms and Conditions
  • Contact us
Contact us for business inquiries: cs@ajoobz.com

Copyright © 2023 Ajoobz.
Ajoobz is not responsible for the content of external sites.

No Result
View All Result
  • Home
  • Bitcoin
  • Crypto Updates
    • Crypto Updates
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Web3
  • Metaverse
  • Scam Alert
  • Regulations
  • Analysis

Copyright © 2023 Ajoobz.
Ajoobz is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In