blocking_semaphore

A simple and performant blocking semaphore.
todo list:
- binary semaphore
- counting semaphore
- portable stdlib implementation with Condvar
- optimize Windows performance with WaitOnAddress
- optimize Linux performance with futex
- optimize macOS performance with dispatch_semaphore