nicknapoli82/cs50_Tideman_cycle-explanation.md

Last active January 23, 2025 18:03

Star (206) You must be signed in to star a gist
Fork (11) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/nicknapoli82/6c5a1706489e70342e9a0a635ae738c9.js"></script>
Save nicknapoli82/6c5a1706489e70342e9a0a635ae738c9 to your computer and use it in GitHub Desktop.

Download ZIP

My attempt at clarifying how cycles work for the Tideman algorithm

Raw

cs50_Tideman_cycle-explanation.md

A Way to Look at Tideman Lock Pairs

I've observed that there is a little bit of a disconnect in understanding what it is that needs to be done to properly implement the lock_pairs function for cs50 Tideman. The goal of this little write-up is simply an attempt at explaining what the problem actually is, and why a cycle imposes a problem.

First:
If you are unfamiliar with the actual problem, or have not read through the entire cs50 Tideman problem description. Then I think you should start there.
cs50 Tideman

Second:
This little write-up is only narrowing in on the idea of cycles, and a way to think about what a cycle is and determine if locking a pair in the pairs array would create that cycle. This does not talk about any other part of the Tideman problem.

lock_pairs Purpose:
The function lock_pairs does one thing, and one thing only. That is update the locked array for all the pairs that should get locked. The lock_pairs function does not know anything about the Tideman problem itself. The function does not know what a source is, it does not care who the eventual winner the Tideman algorithm would pick. It does one thing. It decides whether to lock a pair, or not lock a pair.

I'm saying this explicitly because I see a lot of people make the mistake of inferring the logic the Tideman algorithm would use to take shortcuts in the lock_pairs function. So don't do that. I've also see people conflate the extra ideas that the Tideman algorithm uses, that creates confusion in what it is the lock_pairs function is actually doing.

Once again. lock_pairs doesn't know what a source is, and doesn't care who the eventual winner would be. It is the function that creates the guide (locked) that you can later determine who the winner is.

So what is a cycle?
As explained in the cs50 Tideman problem description. A cycle is is when the winner of a pair can be traced through the graph of what is already locked and get back to that same winner.

Wait... Graph. Maybe that term means something in the realm of computer science.
Wikipedia Graph(abstract data type)

Ok. Don't let that overwhelm you. Just realize that a graph is an idea of how to conceptualize and order data for a problem that can provide a solution in a new way.

So what does a graph have to do with this problem? Lets dig into that with a little bit of a real example.

Consider this:
Candidates = [a, b, c, d]
Sorted Pairs = [(d, a), (a, b), (b, c), (c, a), (d, b), (d, c)]
Note for the Sorted Pairs. If you test and your sorted order is slightly different than this. That is ok based on differences in the sorting algorithm.

We take each pair one pair at a time and determine whether to lock it or not.
When we take a pair, we are in a sense creating a graph, using the locked array as our guide. As more pairs get locked, we have more pairs we may have to check. Our goal is to create a chain, from winner through loser and see if we can get back to our original winner. If we can that is a cycle, and if not then no cycle is created.

So what would that look like?

Taking the first pair (d, a) we take the winner d and see if a wins in any pair that is locked. Since no pairs are locked (this is our first pair) a never wins. So we can lock (d, a). The chain formed would look as follows.
d -> a

Taking the second pair (a, b) we perform the same algorithm. b is never found to win in any locked pair though. So we can safely lock.
a -> b

Taking the third pair (b, c) we do the same once again. c is never found to win in any locked pair though. So we can safely lock.
b -> c

Taking the fourth pair (c, a). a is found to win against another candidate and that pair is locked. So we build a chain starting with c, though a, and see how far it takes us.
c -> a -> b -> c
In this case we find that using the previously locked pairs, we can create a chain from c and get back to c. So we shouldn't lock this pair. It is a cycle.

Taking the fifth pair (d, b) we once again do the same.
d -> b -> c
Because the pair (c, a) was not locked prior to checking our pair (d, b) we can safely lock this pair.

Taking the final pair (d, c) we do the same one more time.
d -> c
And we can lock the final pair.

So considering the above we locked all but one pair. Pair (c, a) was found to create a cycle.

Our final graph would look something like this

		a
	➚	↓
d	➙	b
	➘	↓
		c

Takeaway
The primary thing I hope you observe in the above is the simple logic we can apply to determine what forms a cycle, and what does not. Every time we find a new link in the chain, we have to re-examine all the pairs once again to see if the newly found loser would win against any other candidate in any locked pair. If we find one, we create a new link in the chain and repeat the process.

For every pair we consider. We are creating a new graph and attempting to traverse from the original winner, back to that same original winner. Even though the ultimate result is one big graph.

Please note. This is a very simple example, and it is possible that you have to check more than one chain when looking for a cycle. For example if a candidate wins agains multiple other candidates, but also loses.

Why recursion is helpful
The reason recursion can help in solving this type of problem is based on how each recursive function remembers where it left off if you continue to call it over an over. I do believe this cs50 short video explains how that memory works pretty well. So every time you find you should create a new link in the chain, you can recursively call your function to look for a cycle and all the frames remember where they left off.

I hope my couple thoughts are helpful in aiding you in solving the Tideman problem. Feel free to let me know if you see any error in my above explanation of things, or if you feel I could/should add anything I didn't touch on.

Feel free to use the text4.txt file to test your own solution to the Tideman problem.
To use the file just download to your folder where you tideman program exists and run as
./tideman a b c d < test4.txt

Raw

	5
	a
	c
	b
	d
	b
	c
	d
	a
	d
	c
	a
	b
	d
	a
	b
	c
	d
	b
	c
	a

AlexMartosP commented Nov 17, 2023

Thank you so much for the clarification, helped a lot!

ani0525 commented Dec 26, 2023

Thank you! This explanation is golden.

Sutasu-sama commented Jan 22, 2024

You helped me a lot mate. Cheers!

diegomendesmoreno commented Feb 16, 2024 •

edited

Loading

Taking the fourth pair (c, a). a is found to win against another candidate and that pair is locked. So we build a chain starting with c, though a, and see how far it takes us.
c -> a -> b -> c
In this case we find that using the previously locked pairs, we can create a chain from c and get back to c. So we shouldn't lock this pair. It is a cycle.

Thank you for this... This quoted part and the Tideman Testing tool finally allowed me to finish this problem.

fakeboxman commented May 15, 2024

Thank you.

khlawa commented Jul 12, 2024

Thanks!

antatj commented Jul 31, 2024

The main issue is the incorrectly defined request for the sort_pairs function:

"Complete the sort_pairs function. The function should sort the pairs array in decreasing order of strength of victory, where strength of victory is defined to be the number of voters who prefer the preferred candidate. If multiple pairs have the same strength of victory, you may assume that the order does not matter."

According to the last sentence in the request, if we have multiple pairs with the same strength of victory, the order does not matter. However, the order is actually quite important!

Consider the following test case where most programs fail:

Vote 1: A > B > C
Vote 2: B > C > A
Vote 3: C > A > B
In this case, the pairs would be: (A, B), (B, C), and (C, A). But they could also be BC AB CA or CA AB BC... Depending on how we sort these pairs, we get different results! Try it.

The solution lies in modifying the request for the sort_pairs function. If we have multiple pairs with the same strength of victory, they should be sorted according to the order of voting. In this case, (A, B), (B, C), and (C, A). The first vote was A B C, so the first pair is AB; the second vote was B C A, so the second pair is BC, and so on. Otherwise, the program fails the test.

(lock_pairs skips final pair if it creates a cycle
Cause: lock_pairs did not correctly lock all non-cyclical pairs)

(lock_pairs skips middle pair if it creates a cycle
Cause: lock_pairs did not correctly lock all non-cyclical pairs)

I apologize for the length and if the text is a bit hastily written, but I spent the whole day trying to solve a problem caused by an incorrect request (sort function), and I'm a bit tired of it all. :)

BoburTursunov commented Sep 9, 2024

How does Alice win in this case?)))

Zof90 commented Jan 23, 2025

Merci beaucoup ! Cela m'aide vraiment à comprendre l'exigence et à enfin résoudre le problème !

Je ne comprends vraiment pas comment tout le monde est si instantanément éclairé après avoir lu ce qui précède. Peut-être que je suis juste stupide, mais je continue à tourner en rond depuis une semaine maintenant, et je ne suis pas plus près de résoudre sort_pairs.

Quelqu'un peut-il au moins confirmer si cette tâche peut être réalisée dans lock_pairs, ou nécessite-t-elle effectivement une nouvelle fonction pour faciliter la récursivité, comme le suggèrent la plupart des gens ?

If you're as stupid as you say, then we're two. Lol, stay strong!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment