Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
When Synthetic Data Lies: A Hidden Correlation Problem I Didn’t Expect
Mohamed Hussain S
Mohamed Hussain S
Mohamed Hussain S
Follow
Mar 26
When Synthetic Data Lies: A Hidden Correlation Problem I Didn’t Expect
#
dataengineering
#
clickhouse
#
analytics
#
debugging
3
 reactions
Comments
Add Comment
3 min read
Why Data Governance Is Not Optional in a Microsoft Fabric Workflow
PrachiBhende
PrachiBhende
PrachiBhende
Follow
Mar 30
Why Data Governance Is Not Optional in a Microsoft Fabric Workflow
#
architecture
#
dataengineering
#
microsoft
#
security
1
 reaction
Comments
Add Comment
6 min read
Issues of Multi-GB Spreadsheets in Data Lakes
Toby Patrick
Toby Patrick
Toby Patrick
Follow
Mar 26
Issues of Multi-GB Spreadsheets in Data Lakes
#
data
#
dataengineering
#
performance
Comments
Add Comment
4 min read
Asset-Based Data Orchestration: Lessons from Building a Multi-State Social Data Platform
uninterrupted
uninterrupted
uninterrupted
Follow
for
u11d
Mar 26
Asset-Based Data Orchestration: Lessons from Building a Multi-State Social Data Platform
#
dagster
#
dataorchestration
#
dataengineering
1
 reaction
Comments
Add Comment
6 min read
How to Build a Scalable Serverless Social Media Ingestion & Analytics Pipeline on AWS
CapeStart
CapeStart
CapeStart
Follow
Mar 26
How to Build a Scalable Serverless Social Media Ingestion & Analytics Pipeline on AWS
#
aws
#
serverless
#
dataengineering
1
 reaction
Comments
Add Comment
4 min read
Bypassing the "Pandas RAM Tax": Building a Zero-Copy CSV Extractor in C
NARESH-CN2
NARESH-CN2
NARESH-CN2
Follow
Apr 14
Bypassing the "Pandas RAM Tax": Building a Zero-Copy CSV Extractor in C
#
c
#
python
#
dataengineering
#
performance
1
 reaction
Comments
1
 comment
2 min read
Break Free from Wearable Silos: Building a Universal Health Data ETL with Health Connect & FHIR
Beck_Moulton
Beck_Moulton
Beck_Moulton
Follow
Mar 25
Break Free from Wearable Silos: Building a Universal Health Data ETL with Health Connect & FHIR
#
android
#
dataengineering
#
healthtech
#
fhir
2
 reactions
Comments
Add Comment
3 min read
The TLS Fingerprinting Hell: Why I Stopped Reverse-Engineering the Vinted App
KazKN
KazKN
KazKN
Follow
Mar 25
The TLS Fingerprinting Hell: Why I Stopped Reverse-Engineering the Vinted App
#
webdev
#
python
#
dataengineering
#
security
Comments
Add Comment
5 min read
Stateful AI: Streaming Long-Term Agent Memory with Amazon Kinesis
Jubin Soni
Jubin Soni
Jubin Soni
Follow
Mar 25
Stateful AI: Streaming Long-Term Agent Memory with Amazon Kinesis
#
aws
#
generativeai
#
dataengineering
#
amazonkinesis
2
 reactions
Comments
Add Comment
6 min read
Cara Menggunakan API EMR: Panduan Lengkap
Walse
Walse
Walse
Follow
Mar 24
Cara Menggunakan API EMR: Panduan Lengkap
#
api
#
aws
#
dataengineering
#
tutorial
Comments
Add Comment
7 min read
LINUX AS THE NERVOUS SYSTEM OF DATA ENGINEERING
Collins Njeru
Collins Njeru
Collins Njeru
Follow
Mar 28
LINUX AS THE NERVOUS SYSTEM OF DATA ENGINEERING
#
dataengineering
#
luxdevhq
#
harunmbaabu
#
deeplearning
3
 reactions
Comments
Add Comment
3 min read
Automating Clinical Data Analysis: The Pipeline From Hospital Exports to Paper Drafts
local ai
local ai
local ai
Follow
Mar 28
Automating Clinical Data Analysis: The Pipeline From Hospital Exports to Paper Drafts
#
automation
#
dataengineering
#
datascience
#
writing
1
 reaction
Comments
Add Comment
2 min read
Comment utiliser les APIs EMR ?
Antoine Laurent
Antoine Laurent
Antoine Laurent
Follow
Mar 24
Comment utiliser les APIs EMR ?
#
api
#
automation
#
aws
#
dataengineering
Comments
Add Comment
8 min read
Are ClickHouse JOINs Slow? A 2026 PR-by-PR Analysis
Manveer Chawla
Manveer Chawla
Manveer Chawla
Follow
Apr 15
Are ClickHouse JOINs Slow? A 2026 PR-by-PR Analysis
#
data
#
analytics
#
dataengineering
#
database
6
 reactions
Comments
Add Comment
18 min read
Building High-Performance Data Stacks: Vector Search, SQLite Ops, & Open-Source Monitoring
soy
soy
soy
Follow
Mar 27
Building High-Performance Data Stacks: Vector Search, SQLite Ops, & Open-Source Monitoring
#
database
#
sql
#
dataengineering
1
 reaction
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account