strIPlib Image Comparison (SIC)

ICIP is first introduced in the companion paper for this project, "An Image-based Approach to Detecting Structural Similarity Among Mixed Integer Programs." While this paper is under review, a draft has been made available on SSRN.

The purpose of this project was to demonstrate the types of insights which may be garnered by using ICIP. These insights are demonstrated through two separate analyses, the results of which are housed in this project directory (if you wish to jump straight to the second analysis included in this study, simply click here). For a discussion on the results housed on this page, please refer to the paper.

The first analysis in this project considers a subset of instances from the library of structured integer programs (strIPlib). These instances come from “pure” problems (e.g., the bin packing problem, the cutting stock problem), which are known to have clear and exploitable structures (namely for the application of column generation and other decomposition techniques). The relationships between some of these pure problems are known a priori. These relationships provide a “ground truth,” and allow for ICIP's performance to be validated. Note that several problem types include instances from more than one submission source - the results of this analysis are nested first by problem, then by source, and finally by instance.

The data set for this analysis included a total of 950 MIP instances - 50 from each of 19 unique problem types. An autoencoder was trained on the corresponding set of CCM images, and subsequently used to generate 950 feature vectors. ISS values were computed between individual feature vectors, and also between average feature vectors at both the source and problem-type levels. The network visualizations below depict the relationships between pairs of problem types. Edges are drawn between problem pairs which share an ISS value below the defined threshold. Use the slider to change this threshold for yourself, and watch the connections grow! See the table below for problem descriptions.

Finally, we encourage others to study these instances on their own. There's no need to recreate our subset of strIPlib instances on your own, though. Click here to download the data set used in this analysis.

Network Visualizations

These are network visualizations of the relationships between strIPlib problem types.

3.00


These are network visualizations of the relationships between strIPlib problem types.
Edges drawn when ISS is less than the current threshold.

Problem types

This table contains problem descriptions and composite CCM images for each of the 19 strIPlib problem types considered in this study. Click on any problem abbreviation to see the corresponding results from SIP, as well as a list of all the submission sources for that problem type.

ABBREVIATION PROBLEM TITLE COMPOSITE IMAGE
bpp Bin packing problem bpp problem composite image
bp2 Two-dimensional bin packing problem bp2 problem composite image
bif Bin packing problem with item fragmentation bif problem composite image
clp Container loading problem clp problem composite image
col Vertex coloring problem col problem composite image
cpm Capacitated p-median problem cpm problem composite image
cut Cutting stock problem cut problem composite image
cvr Capacitated vehicle routing problem cvr problem composite image
cwl Capacitated warehouse (facility) location problem cwl problem composite image
gap Generalized assignment problem gap problem composite image
inr International nurse rostering problem inr problem composite image
kps Knapsack problem with setups kps problem composite image
lot Lot sizing problem lot problem composite image
map Map labeling problem map problem composite image
pcp Graph partition coloring problem pcp problem composite image
rel Relaxed clique covering problem rel problem composite image
sch Scheduling problem sch problem composite image
tup Traveling umpire problem tup problem composite image
vrp Vehicle routing problem with time windows vrp problem composite image

MIPLIB Instances in SIC

The second analysis in this project introduced a subset of 12 instances from MIPLIB 2017. These instances come from complex, real- world settings, as opposed to clearly defined theoretical problem definitions (like the strIPlib instances). These instances, which we refer to as “blended,” are compared to the strIPlib problems at both the source and problem-type levels.

The network visualizations below depict the relationships between MIPLIB instances and strIPlib sources. Edges are drawn between problem pairs which share an ISS value below the defined threshold. Once again, you can use the slider to change this ISS threshold for yourself. See the table below for problem descriptions.

Network Visualizations

These are network visualizations of the relationships between MIPLIB instances and strIPlib sources.

2.00


These are network visualizations of the relationships between MIPLIB instances and strIPlib sources.
Edges drawn when ISS is less than the current threshold.

MIPLIB Instances

This table contains problem descriptions and CCM images for each of the 12 MIPLIB instances considered in this study. Click on any instance abbreviation to see the corresponding results from SIP, at both the source and problem-type levels.

ABBREVIATION NAME SUBMITTER DESCRIPTION CCM IMAGE
1 50v 50v-10 Serge Bisaillon Network loading problem. 50v CCM image
2 adu adult-max5features Berk Ustun MIP to create optimized data-driven scoring systems. adu CCM image
3 atm atm20-100 Matthew Galati ATM cash management problem. atm CCM image
4 con control30-5-10-4 Qie He Optimal control of a discrete-time switched system model. con CCM image
5 csc csched007 Tallys Yunes Cumulative scheduling problem. csc CCM image
6 cvs cvs16r89-60 Michael Bastubbe Capacitated vertex separator problem. cvs CCM image
7 dan danoint Daniel Bienstock Telecommunications application problem. dan CCM image
8 f2g f2gap801600 Salim Haddadi Restriction of well-known hard generalized assignment problem. f2g CCM image
9 fhn fhnw-schedule-paira400 Simon Felix Continuous-time project scheduling and selection problem. fhn CCM image
10 ger ger50-17-ptp-pop-6t C. Raack Multi-layer network design problem. ger CCM image
11 uni unitcal_7 R. O’Neill California seven day unit commitment problem. uni CCM image
12 tho thor50dday Daniel Rehfeldt Steiner tree problem in graphs. tho CCM image