A JuMP tutorial for GAMS users

JuMP (Julia for Mathematical Optimization) is an Algebraic Modelling Language (AML) that allows to write optimisation problems using a concise mathematical formulation, acting as interface to the specific solver engine API. For non-linear optimisation problems it allows to keep a high-level approach that doesn't require the modeller to compute the Jacobian or the Hessian.
It is developed by the MIT Operations Research Center and appeared in 2013 as an open source package of the relatively new Julia programming language.

GAMS (The General Algebraic Modeling System) does more or less the same things and appeared in the '70s as a project of the World Bank. GAMS is hence a very mature project (maybe too mature) with a lot of followers in the economic domain, where it is used mainly to solve equilibria problems.

This mini-tutorial is intended for gams users that want to try JuMP. There may be two reasons for someone to with to use JuMP instead of GAMS.
The most obvious one, even if often it isn't the key driver, is that GAMS is a commercial software while JuMP being open-source is free both as freedom and as a free beer.
While for GAMS a licence for the underlying solver engine is often included with a particular version of GAMS, JuMP would still require the user to buy a licence to use a specific commercial solvers. However JuMP interfaces with both GLPK (for linear and mixed-integer programming) and IPOPT (for non-linear optimisation) open-source solvers, both of which are top on their classes, leaving the necessity to acquire a licence for a commercial solver to niche cases.
The second reason (and, to me, the most important one) resides in the language features and in the availability of development environments. GAMS uses a VERY ODD syntax, somehow derived from the Cobol language, that is very distant from any programming language in use nowadays. For example a macro mechanism to provide an elementary way to structure the code in reusable components has been introduced only in GAMS 22.9. Its own editor is also very terrible, but as most text editors do not provide a GAMS syntax highlighting, it's still the most common way to code in GAMS.

JuMP, at the opposite, is both open source and it allow to write the model in a powerful general-purpose language like Julia
You have plenty of development environment to choose from (e.g. Jupiter, Juno), a clear modern language, the possibility to interface your model with third party libraries.. all of this basically for free.
It is also, at least for my user case, much faster than GAMS. Aside the preparation of the model to pass to the solver, where it is roughly equivalent, in the solver execution I can benefit of having on my system a version of IPOPT compiled with the much more performing ma27 linear solver, while for GAMS I would have to rely on the embedded version that is compiled with the MUMPS linear solver. That's part of the flexibility you gain in using JuMP in place of GAMS. That's said, for people that don't need such flexibility, the package automatically install a local pre-compiled version of the solver, so just adding the package relative to the solver is enough to start writing the model.

So let's start. We will see how to code the trasnport.gms problem, the one that ship as default example in GAMS¹⁾, using JuMP. For a fictions product, there are three canning plants and three markets and the objective of the model is to find the optimal allocation of products between plants and markets that minimises the (transport) costs.
GAMS equivalent code is inserted as single-dash comments. The original GAMS code needs slightly different ordering of the commands and it's available at http://www.gams.com/mccarl/trnsport.gms

Installation

Step 1:

Option a: Get an account on JuliaBox.com to run julia/JuMP script without installing anything on the local computer
Option b: Install Julia for your platform (http://julialang.org/downloads/)

Step 2:

Run, only once, the following code to install JuMP language and a couple of open source solvers:

using Pkg               # Load the package manager
Pkg.update()            # To refresh the list of newest packages
Pkg.add("CSV")          # A library to work with Comma Separated Values
Pkg.add("DataFrames")   # A library to deal with dataframes (R like tabular data)
Pkg.add("JuMP")         # The mathematical optimisation library
Pkg.add("GLPK")         # A linear and MIP solver
Pkg.add("Ipopt")        # A non-linear solver (not needed in this example)

Model components

Importing the libraries

You will need to import as a minima the JuMP module and a suitable solver. In this case the problem is linear, so we can use GLPK (HiGHS is another popular alternative). If the problem would have been non-linear, you could have used the Ipopt solver/package

# Import of the JuMP, GLPK, CSV and DataFrames modules (the latter twos just to import the data from a header based table, as in the original trasnport example in GAMS 
using CSV, DataFrames, GLPK, JuMP

Defining the "sets"

JuMP doesn't really have a concept of sets, but it uses the native containers available in the core Julia language\\Variables, parameters and constraints can be indexed using these containers.
While many works with position-based lists, I find more readable using dictionaries instead. So the “sets” are represented as lists, but then everything else is a dictionary with the elements of the list as keys.
One note: it seems that Julia/JuMP don't like much the “-” symbol, so I replaced it to “_”.

# Define sets #
#  Sets
#       i   canning plants   / seattle, san-diego /
#       j   markets          / new-york, chicago, topeka / ;
plants  = ["seattle","san_diego"]          # canning plants
markets = ["new_york","chicago","topeka"]  # markets

Definition of the "parameters"

Capacity of plants and demand of markets are directly defined as dictionaries, while the distance is first read as a DataFrame from a white-space separated table and then it is converted in a “(plant, market) ⇒ value” dictionary.

# Define parameters #
#   Parameters
#       a(i)  capacity of plant i in cases
#         /    seattle     350
#              san-diego   600  /
a = Dict(              # capacity of plant i in cases
  "seattle"   => 350,
  "san_diego" => 600,
)
 
#       b(j)  demand at market j in cases
#         /    new-york    325
#              chicago     300
#              topeka      275  / ;
b = Dict(              # demand at market j in cases
  "new_york"  => 325,
  "chicago"   => 300,
  "topeka"    => 275,
)
 
# Table d(i,j)  distance in thousands of miles
#                    new-york       chicago      topeka
#      seattle          2.5           1.7          1.8
#      san-diego        2.5           1.8          1.4  ;
d_table = CSV.read(IOBuffer("""
plants     new_york  chicago  topeka
seattle    2.5       1.7      1.8
san_diego  2.5       1.8      1.4
"""), DataFrame, delim=" ", ignorerepeated=true,copycols=true)
d = Dict( (r[:plants],m) => r[Symbol(m)] for r in eachrow(d_table), m in markets)
# Here we are converting the table in a "(plant, market) => distance" dictionary
# r[:plants]:   the first key, row field using a fixed header
# m:            the second key
# r[Symbol(m)]: the value, the row field with a dynamic header
 
# Scalar f  freight in dollars per case per thousand miles  /90/ ;
f = 90 # freight in dollars per case per thousand miles 
 
# Parameter c(i,j)  transport cost in thousands of dollars per case ;
#            c(i,j) = f * d(i,j) / 1000 ;
# We first declare an empty dictionary and then we fill it with the values
c = Dict() # transport cost in thousands of dollars per case ;
[ c[p,m] = f * d[p,m] / 1000 for p in plants, m in markets]

The above code take advantage of List Comprehensions, a powerful feature of the Julia language that provides a concise way to loop over a list. If we take the creation of the d dictionary as example, without List Comprehensions we would have had to write a nested for loop like:

d = Dict()
for r in eachrow(d_table)
  for m in markets
    d = (r[:plants],m) => r[Symbol(m)]
  end
end

Using List Comprehension is however quicker to code and more readable.

Declaration of the model

Here we declare a JuML optimisation model and we give it a name. This name will be then passed as first argument to all the subsequent operations, like creation of variables, constraints and objective function.
The solver engine to use is given as argument of the Model() call.
We could pass solver-specific options with the set_optimizer_attribute function, e.g.: set_optimizer_attribute(trmodel, “msg_lev”, GLPK.GLP_MSG_ON)

# Model declaration (transport model)
trmodel = Model(GLPK.Optimizer)

Declaration of the model variables

Variables can have multiple-dimensions - that is, being indexed under several indexes -, and bounds are given at the same time as their declaration.
Differently from GAMS, we don't need to define the variable that is on the left hand side of the objective function.

## Define variables ##
#  Variables
#       x(i,j)  shipment quantities in cases
#       z       total transportation costs in thousands of dollars ;
#  Positive Variable x ;
@variables trmodel begin
    x[p in plants, m in markets] >= 0 # shipment quantities in cases
end

Declaration of the model constraints

As in GAMS, each constraint can actually be a “family” of constraints:

## Define contrains ##
# supply(i)   observe supply limit at plant i
# supply(i) .. sum (j, x(i,j)) =l= a(i)
# demand(j)   satisfy demand at market j ;  
# demand(j) .. sum(i, x(i,j)) =g= b(j);
@constraints trmodel begin
    supply[p in plants],   # observe supply limit at plant p
        sum(x[p,m] for m in markets)  <=  a[p]
    demand[m in markets],  # satisfy demand at market m
        sum(x[p,m] for p in plants)  >=  b[m]
end

Declaration of the model objective

Contrary to constraints and variables, the objective is always a unique function. Note that it is at this point that we specify the direction of the optimisation.

# Objective
@objective trmodel Min begin
    sum(c[p,m]*x[p,m] for p in plants, m in markets)
end

Human-readable visualisation of the model (optional)

If we wish we can get the optimisation model printed in a human-readable fashion, so we can expect all is like it should be

print(trmodel)

Resolution of the model

It is at this point that the solver is called and the model is passed to the solver engine for its solution. The return value is the status of the optimisation (MOI.OPTIMAL if all went fine)

optimize!(trmodel)
status = termination_status(trmodel)

Visualisation of the results

While you can do any fancy output you may wish after you retrieve the optimal value of the variables with getvalue(var_name), you can just println(getvalue(x)) to get a basic output.
Notice that you can also easily retrieve the dual value associated to the constraint with getdual(constraint_name).

if status == MOI.OPTIMAL
    println("Objective value: ", objective_value(trmodel))
    println("Shipped quantities: ")
    println(value.(x))
    println("Shadow prices of supply:")
    [println("$p = $(dual(supply[p]))") for p in plants]
    println("Shadow prices of demand:")
    [println("$m = $(dual(demand[m]))") for m in markets]
 
else
    println("Model didn't solved")
    println(status)
end

Editing and running the script

Differently from GAMS you can use whatever editor environment you wish to code a JuMP script. If you don't need debugging features, a simple text editor like Notepad++ (in windows), gedit or kate (in Linux) will suffice. They already have syntax highlight for Julia.
If you want advanced features and debugging capabilities you can use a dedicated Julia IDE, like the Julia extension for VSCode.

If you are using instead the Julia terminal, you can run the script as julia transport.jl.

Further help

Documentation of JuMP is available from this page, and community-based support is available on the Discourse forum.

Happy modelling with JuMP

Complete script

Here is the complete script:

# Transport example
 
# Transposition in JuMP of the basic transport model used in the GAMS tutorial
# 
# This problem finds a least cost shipping schedule that meets
# requirements at markets and supplies at factories.
# 
# - Original formulation: Dantzig, G B, Chapter 3.3. In Linear Programming and Extensions.
# Princeton University Press, Princeton, New Jersey, 1963.
# - Gams implementation: This formulation is described in detail in:
# Rosenthal, R E, Chapter 2: A GAMS Tutorial. In GAMS: A User's Guide.
# The Scientific Press, Redwood City, California, 1988.
# - JuMP implementation: Antonello Lobianco
 
using CSV, DataFrames, GLPK, JuMP
 
# Sets
plants  = ["seattle","san_diego"]          # canning plants
markets = ["new_york","chicago","topeka"]  # markets
 
# Parameters
a = Dict(              # capacity of plant i in cases
  "seattle"   => 350,
  "san_diego" => 600,
)
b = Dict(              # demand at market j in cases
  "new_york"  => 325,
  "chicago"   => 300,
  "topeka"    => 275,
)
 
#  distance in thousands of miles
d_table = CSV.read(IOBuffer("""
plants     new_york  chicago  topeka
seattle    2.5       1.7      1.8
san_diego  2.5       1.8      1.4
"""), DataFrame, delim=" ", ignorerepeated=true,copycols=true)
d = Dict( (r[:plants],m) => r[Symbol(m)] for r in eachrow(d_table), m in markets)
 
f = 90 # freight in dollars per case per thousand miles
 
c = Dict() # transport cost in thousands of dollars per case ;
[ c[p,m] = f * d[p,m] / 1000 for p in plants, m in markets]
 
# Model declaration
trmodel = Model(GLPK.Optimizer) # transport model
 
# Variables
@variables trmodel begin
    x[p in plants, m in markets] >= 0 # shipment quantities in cases
end
 
# Constraints
@constraints trmodel begin
    supply[p in plants],   # observe supply limit at plant p
        sum(x[p,m] for m in markets)  <=  a[p]
    demand[m in markets],  # satisfy demand at market m
        sum(x[p,m] for p in plants)  >=  b[m]
end
 
# Objective
@objective trmodel Min begin
    sum(c[p,m]*x[p,m] for p in plants, m in markets)
end
 
print(trmodel)
 
optimize!(trmodel)
status = termination_status(trmodel)
 
if status == MOI.OPTIMAL
    println("Objective value: ", objective_value(trmodel))
    println("Shipped quantities: ")
    println(value.(x))
    println("Shadow prices of supply:")
    [println("$p = $(dual(supply[p]))") for p in plants]
    println("Shadow prices of demand:")
    [println("$m = $(dual(demand[m]))") for m in markets]
 
else
    println("Model didn't solved")
    println(status)
end
 
# Expected result:
# obj= 153.675
#['seattle','new-york']   = 50
#['seattle','chicago']    = 300
#['seattle','topeka']     = 0
#['san-diego','new-york'] = 275
#['san-diego','chicago']  = 0
#['san-diego','topeka']   = 275

¹⁾

yes, the default GAMS example is named “trasnport”

Discussion

Warning: Trying to access array offset on null in /home/antonello/public_html/antonello/lib/plugins/discussion/helper.php on line 342

Antonello Lobianco, 2018/05/25 12:48

You can use a normal `if`/`else` statement or use the more concise ? ternary operator:

aVariable = CONDITION ? ExprIfTrue : ExprIfFalse

Obviously if the condition is whitin a model to optimise, the model will need to be declared nonlinear.

Warning: Trying to access array offset on null in /home/antonello/public_html/antonello/lib/plugins/discussion/helper.php on line 342

Warning: Trying to access array offset on null in /home/antonello/public_html/antonello/lib/plugins/discussion/helper.php on line 342

Warning: Trying to access array offset on null in /home/antonello/public_html/antonello/lib/plugins/discussion/helper.php on line 342

Warning: Trying to access array offset on null in /home/antonello/public_html/antonello/lib/plugins/discussion/helper.php on line 342

Real name:

E-Mail:

Enter your comment. Wiki syntax is allowed:

Please fill all the letters into the box to prove you're human. Please keep this field empty:

Subscribe to comments

Table of Contents

A JuMP tutorial for GAMS users

Installation

Model components

Importing the libraries

Defining the "sets"

Definition of the "parameters"

Declaration of the model

Declaration of the model variables

Declaration of the model constraints

Declaration of the model objective

Human-readable visualisation of the model (optional)

Resolution of the model

Visualisation of the results

Editing and running the script

Further help

Complete script

Discussion