Created
April 17, 2021 17:52
-
-
Save Mahelita/a6a934071f926a944d57ad0c6c99852d to your computer and use it in GitHub Desktop.
Scrape tonie tracks
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Unfortunately, retrieving a "series" using requests.get() for me doesn't return the same information I'd get via the browser. Example: https://tonies.com/de-de/tonies/?series=anne-kaffeekanne ("tonies.de" gets replaced by "tonies.com/de-de" and "tonies/${series}" becomes "tonies/?series=${series}") - in the browser I get 1 hit while the python code returns some random, and unrelated, stuff :(
Any suggestions what might go wrong here?
Also I'm still trying to find out what's happening in the 4th stage... I'm getting some rather bad matches