← Vissza a listához

Futtatás részletei

https://bodrogdental.hu/

Azonosítók

Státusz
failed
Task ID
a9d4b60b-8d07-4079-a68b-6a3aae227ac3
State fájl
20260320_105402_bodrogdental_hu.json
Aktuális stage
taxonomy_enrichment_alt
Létrehozva
2026-03-20 10:54:02
Frissítve
2026-03-20 10:58:08
Futás időtartama
4 min 5 s

Stage-ek

A séma az aktuális config alapján rajzolódik; egy régebbi futtatás más orchestrátorral készülhetett. BPMN-szerű jelölés: zöld kör = kezdőesemény, kék + rombusz = párhuzamos fork/join, lekerekített téglalap = task (szín = státusz), dupla piros kör = befejezés. Egér a task fölött: részletek (státusz, időpontok, hiba).

Stage Státusz Indult Befejezve Időtartam Hiba
data_extraction failed 2026-03-20 10:58:08 Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s).
discovery_fetch_validation failed 2026-03-20 10:54:02 2026-03-20 10:58:08 4 min 5 s Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s).
metadata_alt completed 2026-03-20 10:54:02 2026-03-20 10:58:08 4 min 5 s
reviews completed 2026-03-20 10:55:05 2026-03-20 10:58:08 3 min 2 s
taxonomy_enrichment_alt failed 2026-03-20 10:58:08 Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s).

A futtatás hibával zárult — részletek a stage táblázatban.

Nyers state (JSON)
{
  "execution_id": "a9d4b60b-8d07-4079-a68b-6a3aae227ac3",
  "input_url": "https://bodrogdental.hu/",
  "state_filename": "20260320_105402_bodrogdental_hu.json",
  "created_at": "2026-03-20T10:54:02.761222",
  "updated_at": "2026-03-20T10:58:08.197627",
  "stages": {
    "metadata_alt": {
      "stage_name": "metadata_alt",
      "status": "completed",
      "started_at": "2026-03-20T10:54:02.892965",
      "completed_at": "2026-03-20T10:58:08.102062",
      "result": {
        "metadata": {
          "company_name": "Unknown",
          "website": "https://bodrogdental.hu/",
          "error": "Failed to extract content from any page"
        }
      },
      "error": null,
      "metadata": {}
    },
    "discovery_fetch_validation": {
      "stage_name": "discovery_fetch_validation",
      "status": "failed",
      "started_at": "2026-03-20T10:54:02.914512",
      "completed_at": "2026-03-20T10:58:08.151382",
      "result": null,
      "error": "Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s).",
      "metadata": {}
    },
    "reviews": {
      "stage_name": "reviews",
      "status": "completed",
      "started_at": "2026-03-20T10:55:05.710225",
      "completed_at": "2026-03-20T10:58:08.124999",
      "result": {
        "reviews": {
          "company_name": "Unknown",
          "total_reviews": 9,
          "average_rating": 4.8,
          "reviews": [
            {
              "author": "Natália Szász-Román",
              "rating": 3,
              "text": "Jo kirándulohely",
              "date": null
            }
          ],
          "source": "google-maps-scraper",
          "postal_code": "",
          "city": "",
          "street": "",
          "phone": ""
        }
      },
      "error": null,
      "metadata": {}
    },
    "data_extraction": {
      "stage_name": "data_extraction",
      "status": "failed",
      "started_at": null,
      "completed_at": "2026-03-20T10:58:08.174701",
      "result": null,
      "error": "Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s).",
      "metadata": {}
    },
    "taxonomy_enrichment_alt": {
      "stage_name": "taxonomy_enrichment_alt",
      "status": "failed",
      "started_at": null,
      "completed_at": "2026-03-20T10:58:08.197614",
      "result": null,
      "error": "Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s).",
      "metadata": {}
    }
  },
  "overall_status": "failed",
  "current_stage": "taxonomy_enrichment_alt",
  "resume_from_stage": null,
  "llm_usage_summary": null
}

Futtatás naplók

Forrás: data/logs — név szerint illeszkedő .log fájlok (API/orchestrator: a9d4b60b-8d07-4079-a68b-6a3aae227ac3_*.log, CLI: pipeline_a9d4b60b_*.log).

a9d4b60b-8d07-4079-a68b-6a3aae227ac3_20260320_105402.log

data/logs/a9d4b60b-8d07-4079-a68b-6a3aae227ac3_20260320_105402.log

2026-03-20 10:54:02 | INFO     | Starting discovery-fetch-validation (async) for URL: https://bodrogdental.hu/
2026-03-20 10:54:02 | INFO     | Async discovery config: fetch=curl, output=html, prediction=http://docker-host:8000/predict
2026-03-20 10:54:02 | INFO     | Async crawl starting: https://bodrogdental.hu/ (max_depth=2, max_concurrent=10)
2026-03-20 10:54:04 | INFO     | Crawl finished: 1 URLs in 1.1s (success=0, errors=1)
2026-03-20 10:54:04 | INFO     | Crawl produced 0 URLs from BERT (threshold and above), fetching all
2026-03-20 10:54:04 | ERROR    | Async discovery: no BERT candidate URL produced valid content. Tried 0 URL(s).
2026-03-20 10:54:04 | INFO     | Attempting fallback: original URL with trafilatura+markdown