Comments

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Vinicius Fagundes

Jun 30

Your warehouse isn't expensive. Your full table scans are.

#bigdata #bigquery #redshift #aws

5 min read

DataDriven

Jun 16

Does Early Dragon Control Help Scaling Compositions? A Big Data Analysis of League of Legends

#bigdata #leagueoflegende #python #karmincorp

4 min read

Fu'ad Husnan

Jun 7

The Future of Query Optimization: AI-Driven Insights in Big Data

#bigdata #database #ai

7 min read

SciForce

Jun 4

Predictive Maintenance in 2026: How AI, Edge Computing, and Agentic Systems Turn Detection Into Action

#ai #manufacturing #bigdata #datascience

14 min read

Lê Đình Phú

Jun 22

How Uber Built Its Big Data System — From a Few TBs to 350 Petabytes with Sub-Hour Latency

#bigdata #dataengineering #apachehudi #architecture

9 min read

Ara

May 17

Migrating a ScyllaDB Cluster the “Brain Transplant” Way

#scylladb #bigdata #database

6 min read

sezin öztekin

May 10

"We Have DevOps, So Why Not DataOps?"

#datascience #bigdata #dataops #devops

2 min read

Ruslan

May 8

Valentina Studio v17.3 supports new VARIANT field type of DuckDB v1.5

#duckdb #valentinastudio #bigdata #database

1 min read

Lê Đình Phú

Jun 8

Why Big Tech is Migrating from Traditional Databases to NewSQL

#bigdata #dataengineering #database #sql

1 min read

Manish Podiyal

May 4

The Data Refinery: Why Apache Spark is the Engine Behind Real-World Big Data Use Cases

#bigdata #spark #pyspark #dataengineering

2 min read

Apache SeaTunnel

May 28

The Next Decade of Data Engineering: From Modern Data Stack to Data Engineering Harness

#data #dataengineering #dataengineeringharness #bigdata

8 min read

Wilians Conde

Apr 16

Processing High Frequency Solar Data Without HPC: Real Constraints and Design Decisions in MackSun

#dataengineering #mongodb #systemdesign #bigdata

3 min read

StiiWann

May 19

Fentanyl Poverty: Building a Big Data Pipeline to Map America's Overdose Epidemic

#bigdata #elasticsearch #spark #python

3 min read

peter muriya

Apr 14

ETL vs ELT: Which One Should You Use and Why?

#dataengineering #etl #elt #bigdata

5 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.

DEV Community

# bigdata

Your warehouse isn't expensive. Your full table scans are.

Top 12 Spark Interview Problems for Data Engineers, With Answers

Does Early Dragon Control Help Scaling Compositions? A Big Data Analysis of League of Legends

The Future of Query Optimization: AI-Driven Insights in Big Data

Predictive Maintenance in 2026: How AI, Edge Computing, and Agentic Systems Turn Detection Into Action

How Uber Built Its Big Data System — From a Few TBs to 350 Petabytes with Sub-Hour Latency

Migrating a ScyllaDB Cluster the “Brain Transplant” Way

"We Have DevOps, So Why Not DataOps?"

Valentina Studio v17.3 supports new VARIANT field type of DuckDB v1.5

Why Big Tech is Migrating from Traditional Databases to NewSQL

The Data Refinery: Why Apache Spark is the Engine Behind Real-World Big Data Use Cases

The Next Decade of Data Engineering: From Modern Data Stack to Data Engineering Harness

Processing High Frequency Solar Data Without HPC: Real Constraints and Design Decisions in MackSun

Fentanyl Poverty: Building a Big Data Pipeline to Map America's Overdose Epidemic

ETL vs ELT: Which One Should You Use and Why?