v0.6.1: follow linked recipes on sobors.hu, white favicon
- Sobors.hu parser: detect external links in instructions and follow them to scrape real recipe content (e.g. kiskegyed.hu linked recipes) - Article-style ingredient fallback for sobors.hu pages without structured ingredient containers (h4 + ul > li plain text) - Favicon changed to logo_notext_white.svg Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -43,7 +43,7 @@ Docker container for importing recipes from Hungarian websites into [Mealie](htt
|
||||
| mindmegette.hu | Yes | Yes | Yes | Yes |
|
||||
| streetkitchen.hu | Yes (with groups) | Yes (ol/ul/paragraph) | Yes | Yes (from JSON-LD categories) |
|
||||
| nosalty.hu | Yes (with groups) | Yes (with section headers) | Yes | Yes |
|
||||
| sobors.hu | Yes (with groups) | Yes (with section headers) | Yes | Yes |
|
||||
| sobors.hu | Yes (with groups) | Yes (with section headers, follows linked recipes) | Yes | Yes |
|
||||
| *Other sites* | Fallback (schema.org JSON-LD) | Fallback (schema.org JSON-LD) | Yes (og:image) | Fallback (schema.org keywords) |
|
||||
|
||||
### Mindmegette.hu Parser
|
||||
@@ -92,6 +92,8 @@ Extracts data from the sobors.hu recipe pages:
|
||||
- **Ingredients**: `div.hozzavalok-container` → `section` elements with `ul > li`, each containing `span.mennyiseg` (qty), `span.mertekegyseg` (unit), `span.hozzavalo` (food)
|
||||
- **Ingredient groups**: `section > h4` headers (e.g., "A szószhoz:", "A húsgolyókhoz:")
|
||||
- **Instructions**: `div.recept_leiras` → `<p>` tags, with `<h3><strong>` section headers
|
||||
- **Linked recipes**: Some pages link to another site (e.g. kiskegyed.hu) instead of showing full instructions. The parser detects external links in the instruction area and follows them to scrape the real recipe content.
|
||||
- **Article-style ingredient fallback**: Pages without the structured `div.hozzavalok-container` are parsed from article-body `h4` + `ul > li` plain text
|
||||
- **Tags**: `div.cikk-cimkek > ul.cikk-cimkek-list > li > a` (skips generic "Receptek" category)
|
||||
|
||||
### Generic Fallback Parser
|
||||
|
||||
Reference in New Issue
Block a user