← Back to Software
Job Postings
Job postings scraped from xe.gr.
This dataset contains 5,093 job postings collected from xe.gr, one of Greece’s largest employment portals, during January and February 2025. Data was gathered using the R programming language and the rvest web scraping library.
Since version 1.3 a secondary classification table is included, produced by an LLM via the ellmer package, which flags each listing for potential age, gender, appearance, or racial discrimination in the job description text.
Variables
| Column | Type | Description |
|---|---|---|
spc | Qualitative | Occupation / job title |
type | Qualitative | Employment terms (full-time, part-time, …) |
wage | Mixed | Salary or salary range |
loc | Qualitative | Location |
exp | Ordinal | Required work experience |
descr | Qualitative | Full job listing text |
p_date | Qualitative | Publication date |
link | Qualitative | URL of the original listing |
LLM Classification Table
An additional table classifies each listing for discriminatory language:
- Binary flags for age, gender, appearance, and racial discrimination
- A policy flag for listings that violate platform terms
- A reasoning field explaining each classification
Download
| Format | Link |
|---|---|
| CSV | job-posts.csv |
| RDS | job-posts.rds |
| Parquet | job-posts.parquet |
| Feather | job-posts.feather |
| SQLite | job-posts.sqlite |
| CSV (LLM) | llm_classification.csv |