From 480984c7c9f12b089633b235e101228374bc44bf Mon Sep 17 00:00:00 2001 From: kisfenyo Date: Tue, 24 Feb 2026 16:04:29 +0100 Subject: [PATCH] Add StreetKitchen parser documentation to README Co-Authored-By: Claude Opus 4.6 --- README.md | 13 +++++++++++++ 1 file changed, 13 insertions(+) diff --git a/README.md b/README.md index 9798b2d..82c1f90 100644 --- a/README.md +++ b/README.md @@ -41,6 +41,7 @@ Docker container for importing recipes from Hungarian websites into [Mealie](htt | Site | Ingredients | Instructions | Image | Tags | |------|:-----------:|:------------:|:-----:|:----:| | mindmegette.hu | Yes | Yes | Yes | Yes | +| streetkitchen.hu | Yes (with groups) | Yes (ol/ul/paragraph) | Yes | Yes (from JSON-LD categories) | | *Other sites* | Fallback (schema.org JSON-LD) | Fallback (schema.org JSON-LD) | Yes (og:image) | Fallback (schema.org keywords) | ### Mindmegette.hu Parser @@ -55,6 +56,18 @@ Extracts data from the Angular-rendered HTML: - **Instructions**: `mindmegette-wysiwyg-box` → `ol > li` elements - **Tags**: `` elements inside `div.desktop-wrapper` +### Streetkitchen.hu Parser + +Extracts data from the Next.js-rendered HTML: + +- **Title**: `og:title` meta tag, with ` | Street Kitchen` suffix stripped +- **Description**: `og:description` meta tag +- **Image**: `og:image` meta tag (CDN URL) +- **Ingredients**: `div.grid.grid-cols-1` container → `div.my-2.flex` rows; quantity+unit merged in first `
` (split via regex), food in `
`, optional extra in parenthesised `
` +- **Ingredient groups**: `
` headers inside section divs (e.g. "Az előfőzéshez", "A sütéshez") +- **Instructions**: Three formats handled — `
    ` ordered list, `
      ` unordered list, or plain `

      ` paragraphs (with optional `` section headers) +- **Tags**: `recipeCategory` field from JSON-LD `@graph` → `Recipe` object (comma-separated) + ### Generic Fallback Parser For unsupported sites, attempts extraction via: