Claude Code automation research wins the hackathon championship! Winner: I didn’t even know how we won

At Paradigm’s Autoresearch Hackathon, a competitor who had virtually “not designed a strategy in person” ultimately took the championship. The winner, Ryan Li—also the CEO of SurfAI—said that nearly the entire problem-solving process was completed by AI, that he even “didn’t know how he won,” yet still secured first place in the Prediction Market Challenge.

The competition required participants to design a market-making strategy in a simulated binary prediction market: provide liquidity in the order book through limit orders, and achieve a balance of profits between “arbitrageurs” and “retail flow.” The final rankings were determined by the average edge (profit advantage) across 200 random simulations. Ryan’s final score was a $42.32 mean edge (calculated as the median among three sets of random seeds), and after re-rating, he topped the leaderboard.

Claude Code + Codex automated research produced 1,039 strategies

Unlike traditional quant trading or market-making strategies that rely on human experts tuning and modeling, Ryan adopted the “Bitter Lesson” approach proposed in recent years by Rich Sutton—letting computational power and search scale beat human experience. He converted the entire problem into an “automated research” (autoresearch) process, exploring the solution space through multiple AI agents in parallel rather than manually optimizing.

Throughout the process, he used 8 to 20 parallel-running AI agents (primarily based on Claude Code, with additional help from Codex). Each agent was responsible for different assumptions and parameter spaces, continuously generating strategies, running simulations, and reporting results. In the end, he accumulated 1,039 strategy variations, conducted more than 2,000 evaluations, and automatically generated 47 parameter-scan scripts. The overall search scale is equivalent to compressing weeks of manual experiments into a few hours.

A 900-line Python market-making algorithm generated by AI won the hackathon

At the strategy level, the final winning solution was a market-making algorithm of roughly 900 lines of Python. Its core logic did not come from a single design, but from stacking multiple “proven effective” modules. These include avoiding the extremely narrow bid-ask spread zones that arbitrageurs can win consistently against; estimating the true price via information theory; dynamically adjusting quote sizes according to arbitrage risk; and proactively entering to capture high-profit regions when the opponent’s order book gets emptied.

The most crucial breakthrough came from an AI agent that “completely abandoned existing strategies and started from scratch.” When overall optimization stalled at around +25 edge, the agent independently discovered a sizing model centered on “the probability of arbitrage risk,” lifting performance in one step to +44—turning point of the entire competition. This result also directly confirmed Ryan’s methodology: when search gets stuck in local optima, restarting is more effective than fine-tuning.

The absolute advantage of AI research: automated trial and error

In his summary, Ryan said the key to this competition was not designing a “smart strategy,” but building a system that can search at scale, validate ideas, and eliminate them. Rather than relying on human intuition, let AI try things in a vast solution space, and amplify efficiency through parallelization and automation.

This case also further reinforces the shift in the role of “agentic AI” in engineering and research workflows. AI is no longer just an assisting tool; it can directly serve as the core execution unit for exploration and decision-making. In some highly structured, simulatable problems, humans can even completely step out of the role of “problem solver,” and instead design the search framework and evaluation mechanisms themselves.

Claude Code automated research won the hackathon! Winner: I honestly have no idea how I won. First appeared on ChainNews ABMedia.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Kalshi Penalizes Three US Congressional Candidates for Wagering on Own Campaigns

Gate News message, April 23 — Prediction markets platform Kalshi has fined and suspended three congressional candidates for wagering on the outcomes of their own campaigns, stepping up enforcement of insider trading controls. Mark Moran, running for a Senate seat in Virginia, received a $6,229

GateNews22m ago

Polymarket Faces Insider Trading Scrutiny as Trump-Related Prediction Trading Surges

Gate News message, April 23 — Polymarket, a crypto prediction market platform, has drawn insider trading allegations following a surge in trading volume around predictions related to U.S. President Donald Trump's policies and statements. Between April 5 and 8, markets predicting outcomes related to

GateNews3h ago

Delphi AI Prediction Market Launches on Gensyn Mainnet

Gate News message, April 23 — AI prediction market protocol Delphi has officially launched on Gensyn, an AI computing protocol, enabling humans and AI agents to conduct prediction trades on the same platform. Settlement is completed on-chain through verified AI oracles. Gensyn previously launched D

GateNews3h ago

Microsoft Considered Acquiring Cursor, But SpaceX Secured Deal Option at $60B Valuation

Gate News message, April 23 — Microsoft evaluated the possibility of acquiring AI coding tool company Cursor but ultimately did not proceed with the transaction. SpaceX subsequently secured an agreement to acquire Cursor at a $60 billion valuation, with a $10 billion breakup fee if the acquisition i

GateNews3h ago

Météo France Files Police Complaint Over $35K Polymarket Weather Bet Anomalies

Polymarket traders won more than $35,000 after temperature sensor spikes near Paris-Charles de Gaulle airport resolved long-shot weather bets in their favor, prompting France's national weather agency to file a police complaint. The incidents occurred on April 6 and April 15, when a Météo France sen

CryptoFrontier4h ago

Polymarket Trading Volume Surges to $100M+ During U.S.-Iran Tensions, April 5-8

Gate News message, April 23 — Polymarket, a prediction market platform, saw trading volume surge to over $100 million during the U.S.-Iran tensions from April 5 to April 8. According to Dune data, 413 million trades related to Iran conflict occurred during this four-day period, driven by uncertainty

GateNews4h ago
Comment
0/400
No comments