Damocles is an Erlang library intended to make writing and running distributed application tests easier. In this first release, it does this by creating local interfaces on a single machine and controlling the flow of packets between those interfaces, allowing it to run an entire distributed system on any Linux (currently) machine without affecting other apps/traffic/etc (albeit with caps on the load it can handle). By doing so it allows for distributed tests to be run easily in a continuous integration environment, without the need to spin up or allocate separate VMs for each application instance.
Damocles requires:
-
Running on a Linux (developed on Mint 17) that:
- uses 'ip' if available, else 'ifconfig' to add/remove interfaces, and 'lo' is the local interface
- has tc and netem.
- has make
- has sudo permissions for running the above for whatever user you run Damocles as.
-
Erlang installed and on your path (tested on R17; but no use of R17 features means it can likely run on earlier versions with minimal tweaking. Specs do rely on some R17 stuff).
If using Damocles from an Erlang application, you can just add it to your test dependencies.
For those wishing to treat Damocles as a command line utility, you'll need to build the code and run it as a release.
Get the code
git clone https://github.com/lostcolony/damocles [location]
Build the code
cd [location]
make
From Erlang, you can start Damocles as an application, or call damocles:start() or damocles:start_link() as appropriate.
For those wishing to use Damocles as a command line app, execute
[location]/scripts/start.sh
Note that this executes ifconfig and tc commands with sudo; you may need to run it with sudo if it fails.
From Erlang, all commands exist in damocles.erl. A listing is below with examples
From the command line, you can execute any function using scripts/damocles_external, where the first argument is the function from damocles.erl you want to execute, and successive parameters are the arguments you wish to pass in. Lists can be expressed as comma separated strings (see examples).
[location]/scripts/damocles_external add_interface "10.10.10.10"
- Rules may only be applied between IPs that have been added/registered with Damocles.
- Both drop and delay rules may be applied separately and will persist until you have restored the node connection.
- Setting a new drop value to a connection will overwrite an existing drop value; same with delay overwriting an existing value.
- All functions that make changes return one of two things. Either ok, 'error' (which depending on the function called may mean nothing occurred, or, if it was a function that affected multiple connections, it means all the connections you referenced in the call have been reset), or they may throw. If an exception is thrown from within Damocles (as opposed to the RPC interface), and the process has restarted (if started as a command line application the supervisor is used), all interfaces and such we knew about have been torn down so that we're in a 'known' state; you will need to recreate/reregister them. Call get_known_ips to check and see if this has occurred, in the event of getting something other than ok or error.
- Things can go wrong!
- First, since this requires sudo, you may have to get permissions set up properly.
- If you execute Damocles with sudo (easiest thing for the command line), some log folders get created, which get in the way of running make again. If you need to run make again, sudo rm -rf _rel should set you right.
- Since there is implicit OS state, and I'm not clearing interfaces on startup (and clearing traffic control only on initial startup, not on supervisor restoarts), relying instead on a clean shutdown, it may be you end up with interfaces or traffic control settings left behind if a run ends abruptly (kill -9 or machine restart or something). damocles_lib:teardown* functions are callable for the Erlang users; the command line users can run sudo erl -pa ebin from the Damocles folder to start up the Erlang shell, and from there run the teardown commands.
From Erlang, stop the application if it was started that way, or call damocles:stop().
From the command line, execute the stop script
[location]/scripts/stop.sh
Use an IP from a reserved range for internal network IPs. These adapters will be torn down when Damocles is stopped.
From Erlang:
damocles:add_interface("10.10.10.10").
damocles:add_interface("10.10.10.11").
damocles:add_interface("10.10.10.12").
damocles:add_interface("10.10.10.13").
damocles:add_interface("10.10.10.14").
From the command line:
[location]/scripts/damocles_external add_interface "10.10.10.10"
[location]/scripts/damocles_external add_interface "10.10.10.11"
[location]/scripts/damocles_external add_interface "10.10.10.12"
[location]/scripts/damocles_external add_interface "10.10.10.13"
[location]/scripts/damocles_external add_interface "10.10.10.14"
Use an IP from an existing local adapter. These adapters will not be torn down when Damocles is stopped, but will have any rules you have applied to them torn down.
From Erlang:
damocles:register_interface("10.10.10.15").
From the command line:
[location]/scripts/damocles_external register_interface "10.10.10.15"
Prevent all traffic flowing out from source to destination, but not traffic flowing the other direction.
From Erlang:
damocles:isolate_one_way("10.10.10.10", "10.10.10.11").
From the command line:
[location]/scripts/damocles_external isolate_one_way 10.10.10.10 10.10.10.11
Will prevent all traffic to and from the specified interface from those other interfaces Damocles knows about (and no others; i.e., it will still be reachable from 127.0.0.1)
From Erlang:
damocles:isolate_interface("10.10.10.10").
From the command line:
[location]/scripts/damocles_external isolate_interface 10.10.10.10
Used to isolate two sets of nodes from each other. Note that any nodes not included in either set retain any pre-existing rules (or lack thereof). That is, if you have nodes running on (prefix).10, .11, .12, .13, and .14, and call this with [.10, .11], and [.13, .14], as per the example below, .10 and .11 can still talk, but neither can reach .13 or .14. Similarly, .13 and .14 can talk, but neither can reach .10 or .11. And .12 can still talk to everyone.
From Erlang:
damocles:isolate_between_interfaces(["10.10.10.10", "10.10.10.11"], ["10.10.10.13", "10.10.10.14"])
From the command line:
[location]/scripts/damocles_external isolate_between_interfaces "10.10.10.10,10.10.10.11" "10.10.10.13,10.10.10.14"
Similar to preventing traffic between the two (and overwrites it), this causes only a percentage of packets to be dropped between the src IP and dst IP, but not from the dst IP to the src IP. The third argument is the percent chance of a packet being dropped; this can either be a whole integer percentage (20 = 20%), or a float value between 0.0 and 1.0 (0.2 = 20%).
From Erlang:
damocles:packet_loss_one_way("10.10.10.10", "10.10.10.11", 0.05).
From the command line:
[location]/scripts/damocles_external packet_loss_one_way 10.10.10.10 10.10.10.11 .05
Causes a percentage of packets to be dropped for all traffic flowing in or out of this interface. Note that this applies both in and out, so a 10% chance to drop means that a send and acknowledgement will have a 10% chance to fail on the send, -and- a 10% chance to fail on the acknowledgement.
From Erlang:
damocles:packet_loss_interface("10.10.10.10", 0.05).
From the command line:
[location]/scripts/damocles_external packet_loss_interface 10.10.10.10 .05
Similar to creating node partitions, this causes a percentage of packets to be dropped for all traffic flowing between a node in the first set, to a node in the second set. Note that this applies both in and out, so a 10% chance to drop means that a send and acknowledgement will have a 10% chance to fail on the send, -and- a 10% chance to fail on the acknowledgement.
From Erlang:
damocles:packet_loss_between_interfaces(["10.10.10.10", "10.10.10.11], ["10.10.10.13", "10.10.10.14"], 0.05).
From the command line:
[location]/scripts/damocles_external packet_loss_between_interfaces "10.10.10.10,10.10.10.11" "10.10.10.13,10.10.10.14" .05
This causes a percentage of packets to be dropped for all traffic flowing between two nodes that Damocles knows about. Note that this applies both in and out, so a 10% chance to drop means that a send and acknowledgement will have a 10% chance to fail on the send, -and- a 10% chance to fail on the acknowledgement.
From Erlang:
damocles:packet_loss_global(0.05).
From the command line:
[location]/scripts/damocles_external packet_loss_global .05
Similar to preventing traffic between the two, this causes a fixed delay to be imposed on packets between the src IP and the dst IP, and not the reverse. The delay is an integer in milliseconds.
From Erlang:
damocles:delay_one_way("10.10.10.10", "10.10.10.11", 100).
From the command line:
[location]/scripts/damocles_external delay_one_way 10.10.10.10 10.10.10.11 100
Causes all packets to and from the specified IP to be delayed by the specified amount. Note that this applies both in and out, so a 100ms delay will affect both a sent packet, and an acknowledgement, so that things like pings will take 200ms.
From Erlang:
damocles:delay_interface("10.10.10.10", 100).
From the command line:
[location]/scripts/damocles_external delay_interface 10.10.10.10 100
Similar to creating node partitions, this causes a delay for all traffic flowing between a node in the first set, to a node in the second set. Note that this applies both in and out, so a 100ms delay means that a send and acknowledgement will have a 100ms delay on the send, -and- a 100ms delay on the acknowledgement, for a total ping time of 200ms.
From Erlang:
damocles:delay_between_interfaces(["10.10.10.10", "10.10.10.11], ["10.10.10.13", "10.10.10.14"], 100).
From the command line:
[location]/scripts/damocles_external delay_between_interfaces "10.10.10.10,10.10.10.11" "10.10.10.13,10.10.10.14" 100
This causes a delay on all packets flowing between two nodes that Damocles knows about. Note that this applies both in and out, so a 100 ms delay means that a send and acknowledgement will have a 100ms dekay on the send, -and- a 100ms delay on the acknowledgement.
From Erlang:
damocles:delay_global(100).
From the command line:
[location]/scripts/damocles_external delay_global 100
Will undo any delay/drop you've imposed on traffic flowing from src, to dst (but not the other way).
From Erlang:
damocles:restore_one_way("10.10.10.10", "10.10.10.11").
From the command line:
[location]/scripts/damocles_external restore_one_way 10.10.10.10 10.10.10.11
Will undo any delay/drop you've imposed on the traffic flowing into or out of an interface.
From Erlang:
damocles:restore_interface("10.10.10.10").
From the command line:
[location]/scripts/damocles_external restore_interface 10.10.10.10
Will undo any delay/drop you've imposed on the traffic flowing between interfaces that Damocles knows about.
From Erlang:
damocles:restore_all_interfaces().
From the command line:
[location]/scripts/damocles_external restore_all_interfaces
Returns a list of all IPs Damocles is aware of and can configure.
From Erlang:
damocles:get_known_ips().
From the command line:
[location]/scripts/damocles_external get_known_ips
Returns a proplist of the rules Damocles is applying between a src and dst IP. Note that it only tells you what packets going from src -> dst have applied; you need to query separately to get dst -> src (by calling it with the arguments in the reverse order).
From Erlang:
damocles:get_rules_for_connection("10.10.10.10", "10.10.10.11").
From the command line:
[location]/scripts/damocles_external get_rules_for_connection 10.10.10.10 10.10.10.11
#TODO
- Bugfixes. Highest priority.
- Examples of using this library to test distributed code. Second highest priority.
- Allow for the registering of and manipulation of external IPs rather than local ones. Harder, but useful for load testing, so a higher priority when I next have free time.
- Add bandwidth limitations. Possibly some difficulty, but low priority.
- Add additional mechanisms for delaying/dropping in different patterns. Easy but low priority.
- Refactor some things. Very low priority, but a cause of enough joy it might happen.
- Add OSX support. Probably very hard (not sure the domain mapping for traffic control is the same, nor what utilities OSX comes with), moderate priority.
MIT