Solved with threads, slower by 20-30% for this workload apparently.
Might be because it is the same solution, juste using a new thread for each test
instead of one thread. Early exit does not seem to improve things, as the threads
complete very quickly.