readme updated, mainly grammar syntax mistakes

2df6f320 · Jakub Janák · b464955e · 2df6f320
Commit 2df6f320 authored 3 months ago by Jakub Janák
--- a/README.md
+++ b/README.md
 # Traveling Salesman Solver

-This is a repository for Traveling Salesman Solver project. 
-This solver is a command line program that uses branch and bound(also parallel version)
-to solve the traveling salesman problem instances exactly.
-The program loads
-the file in a .dot format and shows the solution in the terminal.
-User can also save the 
- solution (there will be an option in the terminal once instance is solved).  
+This repository contains the Traveling Salesman Solver project.  
+The solver is a command-line application that uses a branch-and-bound algorithm (with an optional parallel version) to solve instances of the Traveling Salesman Problem (TSP) exactly.  
+The program loads a `.dot` format file containing the graph, computes the optimal solution, and displays the result in the terminal.  
+Users can also save the solution once the instance is solved; an option will be presented in the terminal upon completion.

 ## The Problem Formulation
-Traveling salesman is a very famous problem in Computer Science.
-Traveling Salesman problem or TSP asks the following:  
-Given a list of cities and the distances between each pair of cities,
-what is the shortest possible route that visits each city
-exactly once and returns to the origin city?  

-It is not particularly hard to define the problem, but it is tough to solve it effectively (maybe even impossible).  
+The Traveling Salesman Problem (TSP) is a very famous problem in Computer Science.  
+The Traveling Salesman Problem (TSP) asks the following:  
+Given a list of cities and the distances between each pair of cities,  
+what is the shortest possible route that visits each city exactly once and returns to the origin city?
+
+It is not particularly hard to define the problem, but it is tough to solve it effectively (maybe even impossible).

 The mathematical definition would be:  
-Given a directed, weighted graph G = (N, E), where:  
-N = {v1, v2, ..., vn} is a set of nodes,  
-E = {e1, e2, ..., en} is a set of edges,
-w: E -> R+ is a weight function, returning the weight of an edge.  
-
-The goal is to find a Hamiltonian cycle H subset of E such that:
-1. each node in N is visited exactly once, and at the end returns to the starting node.  
-2. the total weight of the H is minimal.  
-
-Objective Function:  
-Find a permutation pi of {1, 2, .., n} such that the total cost of the cycle C(pi) is minimized:  
-min C(pi) = sum(i = 1, n) w(a(pi(i), pi(i+1)))  
-where pi(n + 1) is defined as pi(1) to complete the cycle.  
-
-The branch of mathematics that studies this type of optimization problems is called Combinatorial Optimization.
-
-The easiest solution would be to find every permutation and see what the total weight would be.
-Then find the st of hamiltonian 
-cycles with the minimal path.
-The problem here is that the computational complexity of such an solution is O(n!) and therefore, 
-we will very quickly find ourselves in a place where we are unable to find the solution in normal time(in practice the 
-complexity is (n-1)! because we can set some node to be always first).  
-
-The next possibility is to solve the Traveling Salesman by converting it into an LP(linear program).  
-
-Very popular rhythmical is a branch and bound algorithm.
-This algorithm finds the set of hamiltonian paths that 
-are minimal and is considered exact algorithm.
-
-## Approximation 
-The best way to get a good result from a traveling salesman instance is
-not by solving it exactly but by finding path
-that doesn't have to be optimal but is great.
-This method is called approximation and can be done very quickly.
-In real life scenarios
-is used much more that solving the instance exactly as there may be many nodes and edges(e.g., 1000, 10 000).
+Given a directed, weighted graph \( G = (N, E) \), where:  
+\( N = \{v_1, v_2, \dots, v_n\} \) is a set of nodes,  
+\( E = \{e_1, e_2, \dots, e_n\} \) is a set of edges,  
+\( w: E \to \mathbb{R}^+ \) is a weight function, returning the weight of an edge.
+
+The goal is to find a Hamiltonian cycle \( H \subseteq E \) such that:
+1. Each node in \( N \) is visited exactly once, and at the end returns to the starting node.
+2. The total weight of \( H \) is minimal.
+
+### Objective Function:
+Find a permutation \( \pi \) of \( \{1, 2, \dots, n\} \) such that the total cost of the cycle \( C(\pi) \) is minimized:  
+\[
+\min C(\pi) = \sum_{i=1}^{n} w(a(\pi(i), \pi(i+1)))
+\]
+where \( \pi(n + 1) \) is defined as \( \pi(1) \) to complete the cycle.
+
+The branch of mathematics that studies this type of optimization problem is called Combinatorial Optimization.
+
+The easiest solution would be to find every permutation and see what the total weight would be.  
+Then, find the set of Hamiltonian cycles with the minimal path.  
+The problem here is that the computational complexity of such a solution is \( O(n!) \), and therefore,  
+we will very quickly find ourselves in a place where we are unable to find the solution in normal time.
+(In practice, the complexity is \( (n-1)! \) because we can set some node to always be first).
+
+The next possibility is to solve the Traveling Salesman by converting it into a Linear Program (LP).
+
+A very popular algorithm is the Branch and Bound algorithm.  
+This algorithm finds the set of Hamiltonian paths that are minimal and is considered an exact algorithm.
+
+## Approximation
+
+The most efficient way to obtain a good solution for a Traveling Salesman Problem instance is not by solving it exactly, but by finding a path that, while not necessarily optimal, is still very good.  
+This method, known as approximation, can be performed much more quickly than exact methods.  
+In real-life scenarios, approximation is often preferred over exact solutions, especially when dealing with large instances that contain many nodes and edges (e.g., 1,000 or 10,000 nodes).  
 ![Approximate vs Exact Solution](data_analysis/exact_vs_approx.png)

 ## Branch and Bound Algorithm
-Branch and Bound (B&B) is a powerful algorithm for an exact solution of the travelling salesman problem.
-It systematically explores the search space(all possible tours) while eliminating branches(subproblems)
-that cannot lead to a better solution than the current best. The solution time is much better when compared to the brute force approach:
+
+Branch and Bound (B&B) are a powerful algorithms for finding exact solutions to problems in Combinatorial Optimization, particularly Linear Programs.  
+It systematically explores the search space while eliminating branches (subproblems) that cannot lead to a better solution than the current best.  
+We will be using the B&B algorithm for solving the Traveling Salesman Problem (TSP).
+
+The solution time with Branch and Bound is significantly better when compared to the brute-force approach:
+
 ![Brute force vs B&B](data_analysis/brute_force_vs_branch_bound.png)
-However, we still can't be sure that it will be calculated in reasonable time.
+
+However, even with B&B, we cannot always guarantee that the problem will be solved in a reasonable amount of time.
+
+

 ### Steps
-1. Initialisation
-   - Start with a best tour approximation using some approximation algorhythm(Nearest Neighbour + 2Opt) in our case.
-2. Bounding function
-   - We have to specify a bounding function that will calculate the lower bound for every partial tour taken.
-3. Branching
-   - The branching step involves generating new subproblems by exploring the search space tree. 
-4. Bounding
-   - every time the loer bound calculated by the bounding function is higher than the best result yet the subproblem is pruned and not explored further.
-5. Termination
-   - The algorithm terminates when all tours have been either explored or pruned, and the current best solution is the optimal solution for the TSP.
+
+1. **Initialization**
+   - Start with a best tour approximation using an approximation algorithm (such as Nearest Neighbour + 2-Opt, in our case), or infinity.
+
+2. **Bounding Function**
+   - Define a bounding function that calculates the lower bound for every partial tour explored.
+
+3. **Branching**
+   - The branching step generates new subproblems by exploring the search space tree.
+
+4. **Bounding**
+   - Every time the lower bound calculated by the bounding function is higher than the best result so far, the subproblem is pruned and not explored further.
+
+5. **Termination**
+   - The algorithm terminates when all tours have been either explored or pruned, and the current best solution is the optimal solution for the Traveling Salesman Problem (TSP).

 ### Pseudo-code:
 ```plaintext
@@ -81,60 +88,70 @@ However, we still can't be sure that it will be calculated in reasonable time.
   best_solution = infinity (or a heuristic approximation)
   Create an initial node (partial tour with a single city)

-2. For each neighbour of current node:
-   - If the neighbour is already in this subtour then continue
-   - Connect neighbour to the current sub-tour
-   - If bound is <= minimal tour then recursive search()
-   - If length of the tour is same as the number of nodes:
-      - If the cost is less then minimal tour then clear the set of best tours and add there this tour
-      - Else if cost is the same as the minimal cost then add current tour to the set of the best tours
+2. For each neighbour of the current node:
+   - If the neighbour is already in the subtour, continue
+   - Connect the neighbour to the current sub-tour
+   - If the bound is <= minimal tour, call recursive search()
+   - If the length of the tour equals the number of nodes:
+      - If the cost is less than the minimal tour cost, clear the set of best tours and add the current tour
+      - Else if the cost is the same as the minimal cost, add the current tour to the set of best tours

 3. Return best_solution as the optimal tour
 ```

 ### Parallelization
-The Branch and bound can be parallelized, and the performance is indeed growing very well. When using the parallelized version,
-we must make the initial heuristic approximation and cannot start with estimate of infinity. The pseudocode is very similar to the synchronized version, but the change of the best tours
-set needs to be synchronized as well as getting the minimal cost. I am therefore synchronizing with two mutexes. However, the biggest problem is that you need to ensure DFS and not
-accidentally search breath-first. Therefore, in each step in the recursion, the method call is added to thread pool and only for the first neighbor the thread continues in the search.
-We can see that the parallelized version of B&B performs much better than synchronized:
-![Brute force vs B&B](data_analysis/synchronized_vs_parallel_bb.png)  

-The scaling of the algorithm with different number of threads is very good too:
-![Brute force vs B&B](data_analysis/algorithm_speedup_across_thread_counts.png)
+The Branch and Bound (B&B) algorithm can be parallelized, and its performance improves significantly when using multiple threads. 
+In the parallelized version, we must start with an initial heuristic approximation and not an estimate of infinity. 
+The pseudocode is similar to the synchronized version, but there are some differences.
+
+The primary change involves synchronizing access to the best tours set and the minimal cost. 
+To achieve this, I use two mutexes to ensure that updates are thread-safe. 
+However, the biggest challenge is ensuring Depth-First Search (DFS) is used instead of Breadth-First Search (BFS). 
+To address this, the method call is added to the thread pool, but only for the first neighbor does the thread continue its search.
+
+The parallelized version of B&B performs much better than the synchronized version:
+![Brute Force vs B&B](data_analysis/synchronized_vs_parallel_bb.png)
+
+The scaling of the algorithm with varying numbers of threads is also impressive:
+![Brute Force vs B&B](data_analysis/algorithm_speedup_across_thread_counts.png)

 ## Implementation
 ...

-## How to use TSS
+## How to Use TSS

-After compiling the code on your machine you need to file the executable called tss. 
-We will be executing this file. If executed with no arguments or (-h, --help) the help window will show.
+After compiling the code on your machine, you need to find the executable file called `tss`.
+You can execute this file with various arguments. 
+If executed with no arguments or with `-h` or `--help`, a help window will show.

 ```bash
 ./tss --help
 ```

-### Loading instances into program queue
-As a first step, we need to load instances into a queue in the program and then solve them. 
+### Loading Instances into Program Queue
+The first step is to load instances into the program queue and then solve them. 
 There are multiple ways to do this.

 #### Load Instances
 ```bash
 ./tss -l "instance_name.dot"
 ```
-This command will load instance from /files/instances folder.
+This command will load instance from `/files/instances` folder.
+
 #### Auto load Instances
 ```bash
 ./tss -a
 ```
-This command will load all instances stored in the /files/instances folder.
+This command will load all instances stored in the `/files/instances` folder.
+
 #### Create Synthetic Instance
 ```bash
 ./tss -c 10
 ```
-This command will create a synthetic instance, you need to specify how may nodes will the instance have.
-The weight will be assigned randomly with ... The instance will then be loaded into queue.
+This command will create a synthetic instance. You need to specify how many nodes the instance should have. 
+The weights will be assigned randomly. 
+The instance will then be loaded into the queue.

 ### Solving loaded instances

@@ -142,26 +159,29 @@ The weight will be assigned randomly with ... The instance will then be loaded i
 ```bash
 ./tss -s
 ```
-This command will solve all instances in the queue with synchronous branch and bound algorithm.
+This command will solve all instances in the queue using the synchronous branch and bound algorithm.
+
 #### Branch and Bound Parallel solve
 ```bash
 ./tss -p 8
 ```
-This command will solve all instances in the queue with parallel branch and bound algo, 
-you need to specify the number of threads it can use.
+This command will solve all instances in the queue using the parallel branch and bound algorithm. 
+You need to specify the number of threads the program can use.
+
 #### Heuristic approximation
 ```bash
 ./tss -e
 ```
-This command will let you approximate the solution using the Nearest neighbor & 2-Opt heuristic approach. 
-The same method the bb and bb-parallel uses to approximate the solution.
+This command will approximate the solution using the Nearest Neighbor & 2-Opt heuristic approach. 
+This is the same method used by the Branch and Bound and Branch and Bound Parallel algorithms to approximate solutions.

 ### Load and solve
 ```bash
 ./tss load solve
 ```
-You need to load the instances at first and then pick a solving method.
-An example will be:
+You need to load the instances first and then pick a solving method(or approximation).
+
+An example command to load all instances and solve using 10 threads:
 ```bash
 ./tss -a -p 10
 ```
\ No newline at end of file