Add StreetKitchen parser documentation to README
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -41,6 +41,7 @@ Docker container for importing recipes from Hungarian websites into [Mealie](htt
|
||||
| Site | Ingredients | Instructions | Image | Tags |
|
||||
|------|:-----------:|:------------:|:-----:|:----:|
|
||||
| mindmegette.hu | Yes | Yes | Yes | Yes |
|
||||
| streetkitchen.hu | Yes (with groups) | Yes (ol/ul/paragraph) | Yes | Yes (from JSON-LD categories) |
|
||||
| *Other sites* | Fallback (schema.org JSON-LD) | Fallback (schema.org JSON-LD) | Yes (og:image) | Fallback (schema.org keywords) |
|
||||
|
||||
### Mindmegette.hu Parser
|
||||
@@ -55,6 +56,18 @@ Extracts data from the Angular-rendered HTML:
|
||||
- **Instructions**: `mindmegette-wysiwyg-box` → `ol > li` elements
|
||||
- **Tags**: `<a class="tag">` elements inside `div.desktop-wrapper`
|
||||
|
||||
### Streetkitchen.hu Parser
|
||||
|
||||
Extracts data from the Next.js-rendered HTML:
|
||||
|
||||
- **Title**: `og:title` meta tag, with ` | Street Kitchen` suffix stripped
|
||||
- **Description**: `og:description` meta tag
|
||||
- **Image**: `og:image` meta tag (CDN URL)
|
||||
- **Ingredients**: `div.grid.grid-cols-1` container → `div.my-2.flex` rows; quantity+unit merged in first `<div>` (split via regex), food in `<div class="font-bold">`, optional extra in parenthesised `<div>`
|
||||
- **Ingredient groups**: `<h5>` headers inside section divs (e.g. "Az előfőzéshez", "A sütéshez")
|
||||
- **Instructions**: Three formats handled — `<ol>` ordered list, `<ul>` unordered list, or plain `<p>` paragraphs (with optional `<strong>` section headers)
|
||||
- **Tags**: `recipeCategory` field from JSON-LD `@graph` → `Recipe` object (comma-separated)
|
||||
|
||||
### Generic Fallback Parser
|
||||
|
||||
For unsupported sites, attempts extraction via:
|
||||
|
||||
Reference in New Issue
Block a user