G-WAN is a new free web server. They seem to be very proud of it, or at least just want to make a lot of money. Well anyway, in almost every sentence they write, they claim that they are 20% cooler than anything else. It feels a bit arrogant. I have to admit, I don't know a lot about web servers, so I can't speak to how good they are.
However, then I saw their Captcha example. I also don't know much about machine learning algorithms, OCR, and stuff like that, but I do know how to read pixels. I also know how to compare values with python :P
They say the following about their Captcha:
[...] difficult or even completely impossible for robots.
Wait wat? If this is true, this is something really outstanding and maybe an alternative to reCaptcha...
But then I was like:
So I wrote this basic stupid pixel by pixel reading and comparing code, to decode the captcha.
smrrd$ python crack_captcha.py
GIF Image
---------
R0lGODlhGAAZAJEAAP///9//v4SkZAAAACH5BAEAAAAALAAAAAAYABkAAAJfhI+pGB0rmHuGAmtEPJj7E23VYlmbeDnMB2guu44J2lWqQi/6Drl0k7hlSKwSiHeBgV5BTK2FNOKIsmQVJekIkdzgTEOVIERY4ApDPoczTOvzCbVtq/G6kt4CK+BdRQEAOw==
Captcha Data Matrix
-------------------
1 1 1 1 2 1 1 1 1 1
1 1 2 2 1
1 1 2 2 1
1 1 2 2 1 1 1 1
1 1 2 2 2 2 2 1
1 1 2 1
1 1 1 1 2 1
2 2 2 2 2 1 1 1 1 1 1
2 1 1 1 1
2 1 1
2 2 2 2 1 1 1
2 1 1
2 1 1 1 1
2 1 1 1 1 1 1
1 2 2 2 1
1 1 2 2 1 1
1 1 2 2 1 1
1 2 2 2 2 1 1
1 2 1 1 1 1 1
1 2 2 1
1 2 2 2 1
color | pixel count
-------------------
0 | 472
1 | 81
2 | 44
color 1 | color 2
---------------------
3 | 4
1 | 9
4 |
---------------------
8 | 13
I also don't understand, what they think this means and why they are so excited about it:
The two sums are: 13 and 8... for the same Captcha image!
By just changing the HTML background color [...]
In the end, this was the first time I tried to solve a Captcha. I think this is the best example of how not to implement it.
kind regards,
samuirai
personal Website http://www.smrrd.de
I'm a member of the Stuttgart Hackerspace - shackspace
edit: to see really cool stuff with reCaptcha, check out what they did: http://www.dc949.org/projects/stiltwalker/
Samuirai,
Thanks for discussing your views about G-WAN, a Web server which inspired you, visibly.
The source code of our captcha.c example states:
This text above is also displayed when the example is run by G-WAN (as shown on your screenshot by the way).
Since you did not follow any of the suggested guidelines (nor you tried to implement anything else having similar goals), you did not even try to make it "difficult or even completely impossible for robots" as we suggested this should be done.
This example was merely showing how to exercise the (ultra-fast) G-WAN native support for in-memory GIF images when dealing with transparency. It was called captcha.c because we illustrated how such a mechanism could be done.
But you did not even attempt to do that.
Maybe trying to do it could be a good exercise, worth being published on GitHub.