diff --git a/CHANGELOG.md b/CHANGELOG.md index 363e713..2504b21 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,5 +1,17 @@ # Changelog +## v0.8.0 (2026-02-24) + +### Added +- GastroHobbi.hu parser: ingredients (with groups), instructions (with embedded lists), tags from JSON-LD articleSection +- Prep time extraction appended to description + +### Fixed +- Ingredient line parser: fractions like "1/2" no longer split incorrectly due to regex backtracking +- En-dash ranges in ingredients (e.g. "10 – 15 dkg") now normalized to "10-15 dkg" +- Unicode fractions (½, ¼, ¾) now recognized as quantity start across all parsers +- Embedded lists in instructions (nested ul>li) no longer produce duplicate entries + ## v0.7.0 (2026-02-24) ### Added diff --git a/README.md b/README.md index 2f1fa0b..0a81387 100644 --- a/README.md +++ b/README.md @@ -45,6 +45,7 @@ Docker container for importing recipes from Hungarian websites into [Mealie](htt | nosalty.hu | Yes (with groups) | Yes (with section headers) | Yes | Yes | | sobors.hu | Yes (with groups) | Yes (with section headers, follows linked recipes) | Yes | Yes | | kiskegyed.hu | Yes (with groups, dual measurements) | Yes (follows sobors.hu links) | Yes | Yes | +| gastrohobbi.hu | Yes (with groups) | Yes (with embedded lists) | Yes | Yes (from JSON-LD categories) | | *Other sites* | Fallback (schema.org JSON-LD) | Fallback (schema.org JSON-LD) | Yes (og:image) | Fallback (schema.org keywords) | ### Mindmegette.hu Parser @@ -111,6 +112,19 @@ Extracts data from kiskegyed.hu recipe pages: - **Cross-site links**: Pages linking to sobors.hu are followed to get the full recipe - **Tags**: `section.tags > a > span` (# prefix stripped, "recept" filtered) +### GastroHobbi.hu Parser + +Extracts data from gastrohobbi.hu recipe pages (WPBakery page builder layout): + +- **Title**: `h1.mpcth-post-title > span.mpcth-color-main-border` +- **Description**: First `

` in the first `wpb_text_column` before the recipe columns; falls back to `og:description` +- **Image**: `og:image` meta tag +- **Ingredients**: Finds `h3` containing "Hozzávalók:", then walks sibling `