News
This repository contains a complete pipeline for extracting structured data from Albert Heijn (AH) grocery receipts. It performs PDF OCR, text parsing, and tabular formatting, ultimately producing a ...
Today, at its annual Data + AI Summit, Databricks announced that it is open-sourcing its core declarative ETL framework as Apache Spark Declarative Pipelines, making it available to the entire ...
Prepare to be astounded by the capabilities of the sport-activities-features framework. It effortlessly handles TCX & GPX activity files and harnesses the power of the Overpass API nodes. Presenting ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results