Task 2: RL4Ising¶

RL4Ising is a task that challenges you to build agents designed to search for the ground state of Ising models and surpass industry-level MIP solvers. Your mission: trains agents to effectively navigate the huge discrete space of Spin-Glass Ising models and to search for the ground state. This task builds upon RL4Maxcut — an integration of Physics and RL designed for real-world scientific applications.

Task Overview¶

In this task, participants are invited to develop ground state agents to obtain high quality solutions and improve the scalability or performance of RL algorithms. We provide a solver baseline, inviting participants to beat or match the baseline results. Participants can explore a variety of avenues, including but not limited to:

Improve Boltzmann Distribution sampling ; upgrade sampling speed or overcoming memory constraints
Apply filter functions, such as, Local Search or Simulated Annealing.
Explore and innovate RL algorithms, such as, MCPG, VCA, ECO-DQN.
Design a Curriculm Learning schedule for more efficient exploration and faster convergance.

Participants are encouraged to propose creative improvements and extensions that further advance the search for the ground state.

Datasets¶

Here we curate a challenging instance-wise Spin Glass dataset featuring geometric frustration, large edge weights, large scale, and high dimensionality.

Geometric frustration: Not all interactions can exist in their lowest energy state (There may be several orientations that reach the same energy level).
Large edge weights: The numerical value of edge weightings can reach several hundred thousand.
Large scale: Graphs containing hundreds or thousands of nodes and tens of thousands of edges.
High dimensionality: Each node has many features associated with it.

1D Spin Glass Dataset¶

Instance	# of Instances	# of Spins	# of Couplings	Coupling Strength	Url	Reference
Chain	6	300	44,850	-244104.0 — 210331.0	https://biqmac.aau.at/library/mac/ising/	[4]
Chain	6	250	31,125	-193801.0 — 277406.0	https://biqmac.aau.at/library/mac/ising/	[4]
Chain	6	200	19,900	-231636.0 — 239764.0	https://biqmac.aau.at/library/mac/ising/	[4]
Chain	6	150	11,175	-219729.0 — 182372.0	https://biqmac.aau.at/library/mac/ising/	[4]
Chain	6	100	4,950	-212231.0 — 239752.0	https://biqmac.aau.at/library/mac/ising/	[4]

2D Spin Glass Dataset¶

Instance	# of Instances	# of Spins	# of Couplings	Coupling Strength	Url	Reference
EA	3	1,600	3,120	-0.9997712503653102 — 0.9997515751485249	https://github.com/VectorInstitute/VariationalNeuralAnnealing?tab=readme-ov-file	[1]
EA	3	900	1,800	-3.768221414584132 — 3.146263619470661	https://zenodo.org/records/7562380	[2]
Spin-Glass	3	256	512	-4.108692623805201 — 2.79308500014654	https://zenodo.org/records/3897413	[3]

3D Spin Glass Dataset¶

Instance	# of Instances	# of Spins	# of Couplings	Coupling Strength	Url	Reference
EA	3	8,000	24,000	-3.7393814023844407 — 4.35465860153889	https://zenodo.org/records/7562380	[2]
EA	1	5,730	40,279	-0.9999756464799796 — 0.9999029159721005	https://zenodo.org/records/14578166	[5]
EA	1	4,644	14,503	-0.9998457617927303 — 0.9997514748439584	https://zenodo.org/records/14578166	[5]
EA	3	3,375	10,125	-3.785011625763501 — 4.119680031623885	https://zenodo.org/records/7562380	[2]
EA	3	1,000	3,000	-4.324181919314443 — 3.473301731916228	https://zenodo.org/records/7562380	[2]

4D Spin Glass Dataset¶

Instance	# of Instances	# of Spins	# of Couplings	Coupling Strength	Url	Reference
EA	3	4,096	16,384	-3.974500095603923 — 4.07455276849542	https://zenodo.org/records/7562380	[2]
EA	3	2,041	9,604	-4.570368670534508 — 3.942823083195053	https://zenodo.org/records/7562380	[2]
EA	3	1,296	5,184	-5.094655883484119 — 3.8885579774279107	https://zenodo.org/records/7562380	[2]
EA	3	625	2,500	-3.824305097349775 — 3.525499579857149	https://zenodo.org/records/7562380	[2]

Starter Kit¶

This starter kit includes training scripts and environment files for the Ising model. Follow the instructions below to get started.

Commands¶

To run the various methods, follow the commands listed below:

Method	Description	Command
MCPG	Monte Carlo Policy Gradient	N/A
ECO-DQN	Exploratory Combinatorial Optimization Deep Q-Network	N/A
Gurobi	Mixed Integer Programming	N/A

Benchmark ¶

Full benchmark can be found here, Benchmark.

Baseline Solvers:

Gurobi: A mixed-integer programming solver that identifies optimal solutions given an objective function, typically by applying a branch-and-cut algorithm.

RL Methods:

MCPG: Parallel MCMC sampling and a filter scheme to replace the objective function with one with a local search technique. [6]

Metrics¶

We will be evaluating scores based on the Hamiltonian (shown below) of the solution obtained. In other words, you are aiming to find the ground state configuration of an Ising model system. Your goal: achieve the lowest energy configuration.

\[E = -J \sum_{i<j}s_i s_j\]

To confirm submitted scores, we will be requiring submission with the encoded bit configuration as well as the reported score.

References