A séma az aktuális config alapján rajzolódik; egy régebbi futtatás más orchestrátorral készülhetett. BPMN-szerű jelölés: zöld kör = kezdőesemény, kék + rombusz = párhuzamos fork/join, lekerekített téglalap = task (szín = státusz), dupla piros kör = befejezés. Egér a task fölött: részletek (státusz, időpontok, hiba).
| Stage | Státusz | Indult | Befejezve | Időtartam | Hiba |
|---|---|---|---|---|---|
| discovery_fetch_validation | failed | 2026-03-26 12:37:19 | 2026-03-26 12:37:20 | 0.9 s | Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s). |
| metadata_alt | completed | 2026-03-26 12:37:19 | 2026-03-26 12:37:50 | 30 s | — |
| reviews | running | 2026-03-26 12:37:50 | — | 35 nap 9 ó (eddig) | — |
A futtatás hibával zárult — részletek a stage táblázatban.
{
"execution_id": "4cfc4d83-6095-4e7f-bdd2-68be5e022550",
"input_url": "https://centro-medical.hu/",
"state_filename": "20260326_123718_centro-medical_hu.json",
"created_at": "2026-03-26T12:37:18.365131",
"updated_at": "2026-03-31T06:55:40.651256",
"stages": {
"metadata_alt": {
"stage_name": "metadata_alt",
"status": "completed",
"started_at": "2026-03-26T12:37:19.136071",
"completed_at": "2026-03-26T12:37:50.119118",
"result": {
"metadata": {
"company_name": "Unknown",
"website": "https://centro-medical.hu/",
"error": "Failed to extract content from any page"
}
},
"error": null,
"metadata": {}
},
"discovery_fetch_validation": {
"stage_name": "discovery_fetch_validation",
"status": "failed",
"started_at": "2026-03-26T12:37:19.306881",
"completed_at": "2026-03-26T12:37:20.176942",
"result": null,
"error": "Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s).",
"metadata": {}
},
"reviews": {
"stage_name": "reviews",
"status": "running",
"started_at": "2026-03-26T12:37:50.362830",
"completed_at": null,
"result": null,
"error": null,
"metadata": {}
}
},
"overall_status": "failed",
"current_stage": "reviews",
"resume_from_stage": null,
"llm_usage_summary": null
}
Forrás: data/logs — név szerint illeszkedő .log fájlok (API/orchestrator: 4cfc4d83-6095-4e7f-bdd2-68be5e022550_*.log, CLI: pipeline_4cfc4d83_*.log).
data/logs/4cfc4d83-6095-4e7f-bdd2-68be5e022550_20260326_123719.log
2026-03-26 12:37:19 | INFO | Starting discovery-fetch-validation (async) for URL: https://centro-medical.hu/ 2026-03-26 12:37:19 | INFO | Async discovery config: fetch=curl, output=html, prediction=http://docker-host:8000/predict 2026-03-26 12:37:19 | INFO | Async crawl starting: https://centro-medical.hu/ (max_depth=2, max_concurrent=10) 2026-03-26 12:37:19 | INFO | Crawl finished: 1 URLs in 0.1s (success=0, errors=1) 2026-03-26 12:37:19 | INFO | Crawl produced 0 URLs from BERT (threshold and above), fetching all 2026-03-26 12:37:19 | ERROR | Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s). 2026-03-26 12:37:19 | INFO | Attempting fallback: original URL with trafilatura+markdown