Add StreetKitchen parser documentation to README

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-24 16:04:29 +01:00
parent bef4b9978d
commit 480984c7c9
+13
View File
@@ -41,6 +41,7 @@ Docker container for importing recipes from Hungarian websites into [Mealie](htt
| Site | Ingredients | Instructions | Image | Tags | | Site | Ingredients | Instructions | Image | Tags |
|------|:-----------:|:------------:|:-----:|:----:| |------|:-----------:|:------------:|:-----:|:----:|
| mindmegette.hu | Yes | Yes | Yes | Yes | | mindmegette.hu | Yes | Yes | Yes | Yes |
| streetkitchen.hu | Yes (with groups) | Yes (ol/ul/paragraph) | Yes | Yes (from JSON-LD categories) |
| *Other sites* | Fallback (schema.org JSON-LD) | Fallback (schema.org JSON-LD) | Yes (og:image) | Fallback (schema.org keywords) | | *Other sites* | Fallback (schema.org JSON-LD) | Fallback (schema.org JSON-LD) | Yes (og:image) | Fallback (schema.org keywords) |
### Mindmegette.hu Parser ### Mindmegette.hu Parser
@@ -55,6 +56,18 @@ Extracts data from the Angular-rendered HTML:
- **Instructions**: `mindmegette-wysiwyg-box``ol > li` elements - **Instructions**: `mindmegette-wysiwyg-box``ol > li` elements
- **Tags**: `<a class="tag">` elements inside `div.desktop-wrapper` - **Tags**: `<a class="tag">` elements inside `div.desktop-wrapper`
### Streetkitchen.hu Parser
Extracts data from the Next.js-rendered HTML:
- **Title**: `og:title` meta tag, with ` | Street Kitchen` suffix stripped
- **Description**: `og:description` meta tag
- **Image**: `og:image` meta tag (CDN URL)
- **Ingredients**: `div.grid.grid-cols-1` container → `div.my-2.flex` rows; quantity+unit merged in first `<div>` (split via regex), food in `<div class="font-bold">`, optional extra in parenthesised `<div>`
- **Ingredient groups**: `<h5>` headers inside section divs (e.g. "Az előfőzéshez", "A sütéshez")
- **Instructions**: Three formats handled — `<ol>` ordered list, `<ul>` unordered list, or plain `<p>` paragraphs (with optional `<strong>` section headers)
- **Tags**: `recipeCategory` field from JSON-LD `@graph``Recipe` object (comma-separated)
### Generic Fallback Parser ### Generic Fallback Parser
For unsupported sites, attempts extraction via: For unsupported sites, attempts extraction via: