Files
hsntsn-scraper/README.md
T

933 B

hsntsn-scraper

.NET console scraper.

Source: http://www.hsn-tsn.de/

CSV output fields:

  • HsnTsn, Hsn, Tsn
  • Brand, VehicleType, Model, OfficialType
  • YearFrom, YearTo
  • PowerPs, PowerKw, DisplacementCcm, FuelType
  • MatchKey
  • SourceQuery, SourceListUrl, SourceDetailUrl

Usage

Scrape all brand pages:

dotnet run --project src/HsnTsnScraper/HsnTsnScraper.csproj > hsntsn.csv

Scrape only specific queries from stdin:

printf "0588\nGolf\n" | dotnet run --project src/HsnTsnScraper/HsnTsnScraper.csproj > hsntsn.csv

Enable detail-page enrichment:

printf "0588\n" | dotnet run --project src/HsnTsnScraper/HsnTsnScraper.csproj -- --include-details

Repair only missing year fields from an existing CSV:

dotnet run --project src/HsnTsnScraper/HsnTsnScraper.csproj -- --repair-years --input-csv hsntsn.csv --output-csv hsntsn.repaired.csv