EUNO.NEWS EUNO.NEWS
  • All (20993) +299
  • AI (3155) +14
  • DevOps (933) +7
  • Software (11054) +203
  • IT (5802) +74
  • Education (48)
  • Notice
  • All (20993) +299
    • AI (3155) +14
    • DevOps (933) +7
    • Software (11054) +203
    • IT (5802) +74
    • Education (48)
  • Notice
  • All (20993) +299
  • AI (3155) +14
  • DevOps (933) +7
  • Software (11054) +203
  • IT (5802) +74
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 day ago · software

    Git & GitHub: A Beginner’s Guide to Version Control for Data Professionals

    Installing Git Windows Users 1. Visit the Git for Windows download pagehttps://git-scm.com/download/win. 2. Download the Windows installer. 3. Run the installe...

    #git #github #version control #data engineering #data science #beginner guide
  • 6 days ago · software

    Open sourcing Dicer: Databricks's auto-sharder

    Article URL: https://www.databricks.com/blog/open-sourcing-dicer-databricks-auto-sharder Comments URL: https://news.ycombinator.com/item?id=46606902 Points: 27...

    #databricks #dicer #auto-sharder #open-source #data-sharding #distributed-systems #data-engineering
  • 1 week ago · software

    Stop Re-running Everything: A Local Incremental Pipeline in DuckDB

    Incremental Models + Cached DAG Runs DuckDB‑only I love local‑first data work… until I catch myself doing the same thing for the 12ᵗʰ time: > “I changed one mo...

    #DuckDB #incremental pipelines #data engineering #local development #ETL #cached DAG runs #incremental models #upsert
  • 1 week ago · software

    Building a Government Tender Intelligence System with Python: Lessons from the Real World

    Government Tender Data: A Developer’s Guide Government tendershttps://bidsathi.com/ are one of the largest structured data sources available in India. Every da...

    #python #web-scraping #data-engineering #government-tenders #etl #api-integration #structured-data
  • 2 weeks ago · software

    Part 7: Gold Layer – Metrics, Watermarks, and Aggregations

    !Cover image for Part 7: Gold Layer – Metrics, Watermarks, and Aggregationshttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,form...

    #gold layer #data engineering #metrics #watermarking #aggregations #event time #dashboard-ready #data pipelines
  • 2 weeks ago · software

    When code-gen suggests deprecated Pandas APIs — a subtle drift that broke a pipeline

    How the failure surfaced The visible fault was a downstream schema mismatch and a failing validation check, not an obvious exception from the generated code. T...

    #pandas #code-generation #deprecated-apis #data-cleaning #pipeline-failure #python #ci-cd #data-engineering
  • 2 weeks ago · software

    When Codegen Suggests Deprecated Pandas APIs — a Cautionary Tale

    How the deprecated API slipped into production I was using a code‑generation model to scaffold a small ETL that normalized CSV files into a canonical DataFrame...

    #code generation #pandas #deprecated APIs #CI/CD #crompt.ai #ETL #data engineering
  • 2 weeks ago · software

    Data Engineering Isn’t About Tools — It’s About Thinking Like This

    Introduction Data engineering is often misunderstood as a discipline driven mainly by tools. New learners are frequently advised to master Airflow, Spark, Kafk...

    #data engineering #data pipelines #architecture #Airflow #Spark #Kafka #dbt #cloud platforms #software design #scalable systems
  • 3 weeks ago · software

    Schema Validation Passed - So Why Did My Pipeline Fail?

    The False Comfort of “Validation Passed” Schema validation does one job really well: it checks if your data file is parseable. json // This passes every schema...

    #schema validation #CI/CD #data pipelines #data engineering #JSON #production bugs #pipeline failures
  • 0 month ago · software

    Data-Architect-Master-Professional-Workbook

    Overview I built a modular, audit‑ready data engineering project and wanted to share it with the community. Features - Clean, production‑style Python - SQL pat...

    #data engineering #ETL #ELT #Python #SQL #data pipelines #open source #GitHub #modular design #debugging
  • 1 month ago · software

    Collecting Real Tourism Listings and Prices at Scale: A Developer’s Guide to Geo-Accurate Data Aggregation

    The Geo‑Context Challenge in Tourism Data Aggregation If you’ve ever tried to aggregate data from global travel platforms—Booking.com, Airbnb, Agoda, Expedia—y...

    #data aggregation #web scraping #geo-location #residential proxies #tourism data #price scraping #data engineering #API design
  • 1 month ago · software

    Building a Reliable Environmental Data Accumulation Pipeline with Python

    !Cover image for Building a Reliable Environmental Data Accumulation Pipeline with Pythonhttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gra...

    #python #etl #data-pipeline #environmental-data #us-epa #pollution-analysis #data-engineering

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026