Task 2: RL4Ising¶
RL4Ising is a task that challenges you to build agents designed to search for the ground state of Ising models and surpass industry-level MIP solvers. Your mission: trains agents to effectively navigate the huge discrete space of Spin-Glass Ising models and to search for the ground state. This task builds upon RL4Maxcut — an integration of Physics and RL designed for real-world scientific applications.
Task Overview¶
In this task, participants are invited to develop ground state agents to obtain high quality solutions and improve the scalability or performance of RL algorithms. We provide a solver baseline, inviting participants to beat or match the baseline results. Participants can explore a variety of avenues, including but not limited to:
Improve Boltzmann Distribution sampling ; upgrade sampling speed or overcoming memory constraints
Apply filter functions, such as, Local Search or Simulated Annealing.
Explore and innovate RL algorithms, such as, MCPG, VCA, ECO-DQN.
Design a Curriculm Learning schedule for more efficient exploration and faster convergance.
Participants are encouraged to propose creative improvements and extensions that further advance the search for the ground state.
Datasets¶
Here we curate a challenging instance-wise Spin Glass dataset featuring geometric frustration, large edge weights, large scale, and high dimensionality.
Geometric frustration: Not all interactions can exist in their lowest energy state (There may be several orientations that reach the same energy level).
Large edge weights: The numerical value of edge weightings can reach several hundred thousand.
Large scale: Graphs containing hundreds or thousands of nodes and tens of thousands of edges.
High dimensionality: Each node has many features associated with it.
1D Spin Glass Dataset¶
Instance |
# of Instances |
# of Spins |
# of Couplings |
Coupling Strength |
Url |
Reference |
|---|---|---|---|---|---|---|
Chain |
6 |
300 |
44,850 |
-244104.0 — 210331.0 |
||
Chain |
6 |
250 |
31,125 |
-193801.0 — 277406.0 |
||
Chain |
6 |
200 |
19,900 |
-231636.0 — 239764.0 |
||
Chain |
6 |
150 |
11,175 |
-219729.0 — 182372.0 |
||
Chain |
6 |
100 |
4,950 |
-212231.0 — 239752.0 |
2D Spin Glass Dataset¶
Instance |
# of Instances |
# of Spins |
# of Couplings |
Coupling Strength |
Url |
Reference |
|---|---|---|---|---|---|---|
EA |
3 |
1,600 |
3,120 |
-0.9997712503653102 — 0.9997515751485249 |
https://github.com/VectorInstitute/VariationalNeuralAnnealing?tab=readme-ov-file |
|
EA |
3 |
900 |
1,800 |
-3.768221414584132 — 3.146263619470661 |
||
Spin-Glass |
3 |
256 |
512 |
-4.108692623805201 — 2.79308500014654 |
3D Spin Glass Dataset¶
Instance |
# of Instances |
# of Spins |
# of Couplings |
Coupling Strength |
Url |
Reference |
|---|---|---|---|---|---|---|
EA |
3 |
8,000 |
24,000 |
-3.7393814023844407 — 4.35465860153889 |
||
EA |
1 |
5,730 |
40,279 |
-0.9999756464799796 — 0.9999029159721005 |
||
EA |
1 |
4,644 |
14,503 |
-0.9998457617927303 — 0.9997514748439584 |
||
EA |
3 |
3,375 |
10,125 |
-3.785011625763501 — 4.119680031623885 |
||
EA |
3 |
1,000 |
3,000 |
-4.324181919314443 — 3.473301731916228 |
4D Spin Glass Dataset¶
Instance |
# of Instances |
# of Spins |
# of Couplings |
Coupling Strength |
Url |
Reference |
|---|---|---|---|---|---|---|
EA |
3 |
4,096 |
16,384 |
-3.974500095603923 — 4.07455276849542 |
||
EA |
3 |
2,041 |
9,604 |
-4.570368670534508 — 3.942823083195053 |
||
EA |
3 |
1,296 |
5,184 |
-5.094655883484119 — 3.8885579774279107 |
||
EA |
3 |
625 |
2,500 |
-3.824305097349775 — 3.525499579857149 |
Starter Kit¶
This starter kit includes training scripts and environment files for the Ising model. Follow the instructions below to get started.
Commands¶
To run the various methods, follow the commands listed below:
Method |
Description |
Command |
|---|---|---|
MCPG |
Monte Carlo Policy Gradient |
N/A |
ECO-DQN |
Exploratory Combinatorial Optimization Deep Q-Network |
N/A |
Gurobi |
Mixed Integer Programming |
N/A |
Benchmark¶
Full benchmark can be found here, Benchmark.
Baseline Solvers:
Gurobi: A mixed-integer programming solver that identifies optimal solutions given an objective function, typically by applying a branch-and-cut algorithm.
RL Methods:
MCPG: Parallel MCMC sampling and a filter scheme to replace the objective function with one with a local search technique. [6]
Metrics¶
We will be evaluating scores based on the Hamiltonian (shown below) of the solution obtained. In other words, you are aiming to find the ground state configuration of an Ising model system. Your goal: achieve the lowest energy configuration.
To confirm submitted scores, we will be requiring submission with the encoded bit configuration as well as the reported score.
References
Chen, C., Chen, R., Li, T., Ao, R., & Wen, Z. (2023). Monte Carlo policy gradient method for binary optimization. arXiv. https://arxiv.org/abs/2307.00783