commit 318035493283aba94719e4a1a0c77e8eb010f305 Author: Stefan Siegl Date: Sun Jul 27 18:21:14 2025 +0200 initial commit. diff --git a/.gitignore b/.gitignore new file mode 100644 index 0000000..ab8c74d --- /dev/null +++ b/.gitignore @@ -0,0 +1,2 @@ +hafensommer-*.json +hafensommer-2025.html diff --git a/README.md b/README.md new file mode 100644 index 0000000..7fba4cb --- /dev/null +++ b/README.md @@ -0,0 +1,41 @@ +# scraper-hafensommer + +minimal HTML to Event JSON scraper for Würzburg Hafensommer + +## Vibe Coding Inspiration :) + +That's the prompt I've used to create it (with current ChatGPT) + + +write a simple website scraper script. first download the page https://www.adticket.de/Hafensommer-Wurzburg.html + +use css selector `.w-paged-listing__list-item` to match a event. +every event shall be stored to a single json file. css select child node with `.c-list-item-event` and pick data attribute `data-sync-id` and pick id from there. save the event json to a file named `hafensommer-{{id}}.json` + +json shall follow schema.org/Event format. + +select child node of `time` element type for `startDate` property (element looks like `