a process to regularly run a benchexec-based SV-COMP-benchmark on a tool in order to asssess bugs to the efficiency of the tool in question.
sudo add-apt-repository ppa:sosy-lab/benchmarking
sudo apt install pv make gcc-multilib benchexec jq grep sed gawk git curl node-json5 opam autoconf libgmp-dev libmpfr-dev pkg-config
git clone https://github.com/goblint/gobcron.git
cd gobcron
in order to communicate the run information back to you, you should add a bot to your zulip instance and store bot-email and bot-apikey for later use in gobcron's configuration.
You need to create a reasonable initial conf/gobcron.user.json
file after installation. You can do that manually via copy/modify conf/gobcron.json
or call the semi-interactive
myserver:/home/huber/gobcron$ bin/init.sh
and review/modify conf/gobcron.user.json
to your satisfaction. All non-mentioned properties, are sourced default-values from the original conf/gobcron.json
file.
- to set up a nightly run on the mainline analyzer, use a config like:
{
"server": {
"name": "server.amazon.com",
"user": "huber",
"threads": "80",
"memory": "2GB"
},
"zulip": {
"bot": {
"email": "[email protected]",
"apikey": "GARBLEDNONSENSE"
},
"mode": "stream",
"stream": "svcomp-nightly"
},
"instance": {
"basedir": "/home/huber/gobcron",
"svbenchdir": "/home/huber/sv-benchmarks"
}
}
- to set up a one-shot on a specific branch, notifying the users with IDs
4711007,4998451
on the zulip instance, use a config like:
{
"server": {
"name": "laptop",
"user": "huber",
"threads": "20",
"memory": "1GB"
},
"zulip": {
"bot": {
"email": "[email protected]",
"apikey": "GARBLEDNONSENSE"
},
"mode": "4711007,4998451"
},
"instance": {
"basedir": "/home/huber/gobcron",
"svbenchdir": "/home/huber/sv-benchmarks",
"gitrepo": "https://github.com/huber4711/analyzer.git",
"branch": "widening-experiment",
"commit": "471169",
"benchconf": "conf/svcomp22.json",
"tag": "hubers-widening"
}
}
- get inspired by other options from gobcron.json
check your config first:
myserver:/home/huber/gobcron$ bin/run.sh --explain
either start a default run with:
myserver:/home/huber/gobcron$ bin/run.sh
or start a custom run with parameters obtained via bin/run.sh -h
:
myserver:/home/huber/gobcron$ bin/run.sh --conf mygobcron.json --disablezulip --skipchangecheck
A sequence of comparable runs can now be efficiently chained by waiting on the lockfile with gobcron's bin/run.sh -q
parameter. This makes most sense, if combined with several custom user configurations
loaded via bin/run.sh -c conf/gobcron.tagname.json
. For example:
# check the individual configurations via the conf.sh tool:
myserver:/home/huber/gobcron$ bin/run.sh -c conf/gobcron.baseline.json --explain
myserver:/home/huber/gobcron$ bin/run.sh -c conf/gobcron.newfeature.json --explain
# start the individual runs one after the other; I recommend doing that inside a detached screen environment
myserver:/home/huber/gobcron$ screen -S baseline -dm bash -c "bin/run.sh -c conf/gobcron.baseline.json"
myserver:/home/huber/gobcron$ screen -S newfeature -dm bash -c "bin/run.sh -q --skipchangecheck -c conf/gobcron.newfeature.json"
Now, the baseline run starts and takes the lock, until its execution finished, and newfeature then directly takes over and starts the benchmark run it has been waiting for. Wrapping the calls in screen will make it possible for you to inspect progress of each benchmark, should you be interested in that. As soon as each benchmark is terminated, the screen closes, though.
You can also use systemd's timer units for a scheduled run:
- make sure that systemd is present even when the user goblint is logged out, and timers are respected with
loginctl enable-linger goblint
- create the file
.config/systemd/user/gobcron.service
:
[Unit]
Description=Runs an SVCOMP goblint benchmark
[Service]
Type=oneshot
Environment="PATH=/usr/lib/ccache/bin:/usr/local/sbin:/usr/local/bin:/usr/bin"
WorkingDirectory=/home/goblint/gobcron
ExecStart=/home/goblint/gobcron/bin/run.sh
[Install]
WantedBy=default.target
- create the file
.config/systemd/user/gobcron.timer
:
[Unit]
Description=A nightly 22:00 benchmark run of goblint SV-Comp
[Timer]
OnCalendar=Mon-Sun *-*-* 22:00:00
Unit=gobcron.service
[Install]
WantedBy=default.target
- enable the timer and service:
systemctl --user enable gobcron.service
systemctl --user enable gobcron.timer
systemctl --user start gobcron.timer
systemctl --user list-timers
- you can manually trigger the gobcron job by
systemctl --user start gobcron
or schedule a one-time run with
systemd-run --user --on-calendar="2025-01-30 20:01:35" bash -c "cd gobcron;bin/run.sh"
start your crontab editor with crontab -e
and enter a line like:
# m h dom mon dow command
5 22 * * * bash -c "/home/huber/gobcron/bin/run.sh"
# end of crontab
in order to start the nightly run at 22:05
You may get an idea of the progress of the current benchmark by checking
myserver:/home/huber/gobcron$ bin/progress.sh
You may view and/or alter the current default configuration, including gobcron.user.json
myserver:/home/huber/gobcron$ bin/conf.sh -s instance.basedir=/home/huber -a
and eventually play with configurations via -g
and -s
.
You may revisit, which results are available under which tag names, and then create a set of comparison tables between exactly these benchmark run results.
myserver:/home/huber/gobcron$ bin/bigcomparison.sh -l
available tags:
[TAG] ............................................ [DIRECTORY]
tag3 ............................................. ( results/current )
tag2 ............................................. ( results/old.1 )
tag4 ............................................. ( results/old.2 )
tag1 ............................................. ( results/old.3 )
myserver:/home/huber/gobcron$ bin/bigcomparison.sh -t tag1 -t tag2 -t tag3
If you have configured your configration with a valid zulip bot, you may send messages to the zulip id of your choice via bin/zulip.sh [ID] "My message"