Abstract
Tor is a popular anonymity network adopted by more than two million users to preserve their privacy. Tor was mainly developed as a low-latency network to support interactive web browsing and messaging applications. However, bandwidth acquisitive applications such as BitTorrent consume a considerable percentage of Tor traffic. This results in an unfair allocation of the available bandwidth and significant degradation in the Quality-of-service (QoS) delivered to users. This paper presents a QoS-aware deep reinforcement learning approach for Tor's circuit scheduling (QDRL). We propose a design that coalesces the two scheduling levels originally presented in Tor and addresses it as a single resource allocation problem. We use the QoS requirements of different applications to set the weight of active circuits passing through a relay. Furthermore, we propose a set of approaches to achieve the optimal trade-off between system fairness and efficiency. We designed and implemented a reinforcement-learning-based scheduling approach (TRLS), a convex-optimization-based scheduling approach (CVX-OPT), and an average-rate-based proportionally fair heuristic (AR-PF). We also compare the proposed approaches with basic heuristics and with the implemented scheduler in Tor. We show that our reinforcement-learning-based approach (TRLS) achieved the highest QoS-aware fairness level with a resilient performance to the changes in an environment with a dynamic nature, such as the Tor network.
| Original language | English |
|---|---|
| Pages (from-to) | 3396-3410 |
| Number of pages | 15 |
| Journal | IEEE Transactions on Network Science and Engineering |
| Volume | 9 |
| Issue number | 5 |
| DOIs | |
| Publication status | Published - 1 Sept 2022 |
Keywords
- Bandwidth
- Deep reinforcement learning
- Delays
- Optimization
- Quality of service
- Reinforcement learning
- Relays
- Resource management
- Sockets
- Tor