v0.8.0: gastrohobbi.hu parser, fix ingredient fraction parsing

Add gastrohobbi.hu parser (WPBakery page builder layout): ingredients
with groups, instructions with embedded lists, tags from JSON-LD
articleSection, prep time extraction.

Fix ingredient line parser: fractions like "1/2" no longer split due to
regex backtracking, en-dash ranges normalized, unicode fractions (½¼¾)
recognized as quantity start across all parsers.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-24 19:17:13 +01:00
parent ba5dae2caa
commit 0ec9ce0c6d
3 changed files with 197 additions and 5 deletions
+12
View File
@@ -1,5 +1,17 @@
# Changelog
## v0.8.0 (2026-02-24)
### Added
- GastroHobbi.hu parser: ingredients (with groups), instructions (with embedded lists), tags from JSON-LD articleSection
- Prep time extraction appended to description
### Fixed
- Ingredient line parser: fractions like "1/2" no longer split incorrectly due to regex backtracking
- En-dash ranges in ingredients (e.g. "10 15 dkg") now normalized to "10-15 dkg"
- Unicode fractions (½, ¼, ¾) now recognized as quantity start across all parsers
- Embedded lists in instructions (nested ul>li) no longer produce duplicate entries
## v0.7.0 (2026-02-24)
### Added