DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Your warehouse isn't expensive. Your full table scans are.

Your warehouse isn't expensive. Your full table scans are.

5 min read
Top 12 Spark Interview Problems for Data Engineers, With Answers

Top 12 Spark Interview Problems for Data Engineers, With Answers

10 min read
Does Early Dragon Control Help Scaling Compositions? A Big Data Analysis of League of Legends

Does Early Dragon Control Help Scaling Compositions? A Big Data Analysis of League of Legends

9
4
4 min read
The Future of Query Optimization: AI-Driven Insights in Big Data

The Future of Query Optimization: AI-Driven Insights in Big Data

7 min read
Predictive Maintenance in 2026: How AI, Edge Computing, and Agentic Systems Turn Detection Into Action

Predictive Maintenance in 2026: How AI, Edge Computing, and Agentic Systems Turn Detection Into Action

14 min read
How Uber Built Its Big Data System — From a Few TBs to 350 Petabytes with Sub-Hour Latency

How Uber Built Its Big Data System — From a Few TBs to 350 Petabytes with Sub-Hour Latency

9 min read
Migrating a ScyllaDB Cluster the “Brain Transplant” Way

Migrating a ScyllaDB Cluster the “Brain Transplant” Way

6 min read
"We Have DevOps, So Why Not DataOps?"

"We Have DevOps, So Why Not DataOps?"

2 min read
Valentina Studio v17.3 supports new VARIANT field type of DuckDB v1.5

Valentina Studio v17.3 supports new VARIANT field type of DuckDB v1.5

1 min read
Why Big Tech is Migrating from Traditional Databases to NewSQL

Why Big Tech is Migrating from Traditional Databases to NewSQL

1
1 min read
The Data Refinery: Why Apache Spark is the Engine Behind Real-World Big Data Use Cases

The Data Refinery: Why Apache Spark is the Engine Behind Real-World Big Data Use Cases

2 min read
The Next Decade of Data Engineering: From Modern Data Stack to Data Engineering Harness

The Next Decade of Data Engineering: From Modern Data Stack to Data Engineering Harness

1
8 min read
Processing High Frequency Solar Data Without HPC: Real Constraints and Design Decisions in MackSun

Processing High Frequency Solar Data Without HPC: Real Constraints and Design Decisions in MackSun

3 min read
Fentanyl Poverty: Building a Big Data Pipeline to Map America's Overdose Epidemic

Fentanyl Poverty: Building a Big Data Pipeline to Map America's Overdose Epidemic

5
4
3 min read
ETL vs ELT: Which One Should You Use and Why?

ETL vs ELT: Which One Should You Use and Why?

5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.