Implements entry splitter, name extractor, field extractor, time normalizer,
schedule line parser, and weekday day-prefix parser. All 26 tests pass.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
- Imports, types, and Prisma client init
- ParsedSchedule and ParsedEntry types for parsing parish data
- ExistingChurch interface for matching
- ImportStats interface for tracking progress
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
- Remove discovermassId/buscarmisasNetworkId from findDuplicateChurch match
passes (importers now do their own pre-check dedup); restore as optional
fields on ExistingChurch to keep type/runtime in sync
- Add HK bounding box to COUNTRY_BOUNDING_BOXES; fix silent 0-result
fallback when country query returns empty from mirror server
- discovermass importer: add --limit flag and skip-already-imported
pre-check using importedSlugs set
- Import scripts: remove discovermassId from ExistingChurch select/stubs
(field not needed in shared matcher context)
- Schema: reorder discovermassId/kerknetId/gottesdienstzeitenId fields
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Imports 20,284 US Catholic churches from discovermass.com including mass,
confession, and adoration schedules. Respects robots.txt Crawl-delay: 10.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Add discovermassId field to ExistingChurch interface and ChurchCandidate type,
insert a dedicated matching pass in findDuplicateChurch, and update all 15 importer
push blocks plus 16 loadExistingChurches select queries to include the new field.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>