EUNO.NEWS EUNO.NEWS
  • All (21455) +34
  • AI (3224) +1
  • DevOps (962) +3
  • Software (11268) +21
  • IT (5948) +8
  • Education (52)
  • Notice
  • All (21455) +34
    • AI (3224) +1
    • DevOps (962) +3
    • Software (11268) +21
    • IT (5948) +8
    • Education (52)
  • Notice
  • All (21455) +34
  • AI (3224) +1
  • DevOps (962) +3
  • Software (11268) +21
  • IT (5948) +8
  • Education (52)
  • Notice
Sources Tags Search
한국어 English 中文
  • 3 weeks ago · software

    Data Engineering Isn’t About Tools — It’s About Thinking Like This

    Introduction Data engineering is often misunderstood as a discipline driven mainly by tools. New learners are frequently advised to master Airflow, Spark, Kafk...

    #data engineering #data pipelines #architecture #Airflow #Spark #Kafka #dbt #cloud platforms #software design #scalable systems
  • 1 month ago · software

    Why Idempotency Is So Important in Data Engineering

    In data engineering, failures are the norm: jobs crash, networks timeout, Airflow retries tasks, Kafka replays messages, and backfills rerun months of data. In...

    #idempotency #data engineering #Airflow #Kafka #Spark #retry logic #data pipelines #distributed systems
  • 1 month ago · software

    GitHub Spark: Improvements, DPA coverage, & dedicated SKU

    Spark is covered by the GitHub Data Protection Agreement As of October 27th, Spark is covered by the GitHub Data Protection Agreement, which means data handlin...

    #GitHub #Spark #Data Protection Agreement #DPA #dedicated SKU #budget controls #preview feature #product improvements
  • 1 month ago · software

    Day 10: Partitioning vs Bucketing - The Spark Optimization Guide Every Data Engineer Needs

    Why Partitioning Matters in Spark Example python df.write.partitionBy'year', 'month'.parquet'/sales' This creates folders such as: year=2024/month=01/ Benefits...

    #spark #partitioning #bucketing #data-engineering #big-data #optimization #parquet #lakehouse
EUNO.NEWS
RSS GitHub © 2026