Serverless Postgres: Autoscaling, Branching, and Recovery | Neon Postgres posted on the topic | LinkedIn

LinkedIn respects your privacy

LinkedIn and 3rd parties use essential and non-essential cookies to provide, secure, analyze and improve our Services, and to show you relevant ads (including professional and job ads) on and off LinkedIn. Learn more in our Cookie Policy.

Select Accept to consent or Reject to decline non-essential cookies for this use. You can update your choices at any time in your settings.

View organization page for Neon Postgres

17,892 followers

3w

We talk a lot about "Serverless Postgres," but what does that actually mean for your daily workflow? By separating storage from compute, Neon gives you a database that actually behaves like modern software: 1. Autoscaling: It scales up when your traffic spikes and scales to zero when it doesn’t, so you don’t pay for idle compute. 2. Branching: Create full copies of your database in seconds. Use them for development, preview deployments, or safe AI sandboxes. 3. Point-in-Time Recovery: Treat your data like code. If something breaks, just roll back to the exact second before the error happened. All the Postgres you love, with none of the infrastructure headaches. Check out the breakdown in the video below!

1 Comment

Transcript

What is NEON? Simple, NEON is serverless postgres. Alright, thanks for watching. OK, NEON is serverless postgres, but what that actually means is NEON offers a fully managed Postgres database that automatically scales with your traffic, supports branching so you can experiment safely, and keeps your data protected with instant point in time recovery. You get all of that without having to manage any servers. But to understand why NEON can do all of that, let's first take a step back. Now, databases are typically comprised of two main components, a storage layer and a compute layer. The storage layer is responsible for persisting your data, and the compute layer is responsible for executing queries to access or modify the data. In traditional databases, these storage and compute layers are tightly coupled, meaning they live on the same machine. Which means if your application receives a lot of traffic, you'll need to upgrade to a larger machine. But when traffic decreases, you're still paying for that bigger machine, even if you're not fully using it. NEON takes a different architectural approach by separating storage and compute, which allows each of these to scale independently for maximum flexibility. So when your app receives more traffic, NEON can automatically scale up your compute capacity, and when your traffic slows back down. NEON can scale your compute back down to reduce your costs, and NEON can even scale all the way down to 0 in cases when you're just getting started and you don't have any traffic yet. Separation of storage and compute also allows NEON databases to support branching for both your schema and your data. So just like you can create git branches to experiment with code changes, now you can create complete copies of your database in seconds. That means you can create branches for isolated. Development and staging environments, but it also means you can create workflows where you spin up branches for preview deployments on pull requests. Plus in the age of AI, branches are perfect for creating safe sandbox data environments where your coding agent can make changes without affecting any data in production. Neons architecture also enables point in time recovery, meaning you can restore your database to an exact moment in time so if you accidentally make a change. That delete some data you can just roll back to a previous state in a matter of seconds. And because NEON is just Postgres, it integrates seamlessly with all of your modern tools and supports many popular Postgres extensions. And lastly, Neon also includes an auth service built right into your database. Neon Auth makes it easy to implement common authentication flows like sign in and sign up, e-mail verification, account management, and a whole lot more. And all of your user data is automatically stored in your database, so you don't have to stitch together separate systems to get started. With Neon, all you have to do is create a project, copy your connection string, paste it into your app, and you're all set. It just works. So that's what we mean when we say NEON is serverless Postgres. If you're ready to give it a try, you can head on over to neon.com to get started.

Trevor I. Lasn, graphic

Trevor I. Lasn 3w

been running Neon in production for months and the branching is what sold me. I spin up a full database copy for every migration test before touching prod. scale to zero is nice for dev branches but branching is the killer feature nobody talks about enough

To view or add a comment, sign in

More Relevant Posts

Anthony Giuliano
3w
Report this post
If you're new to Neon Postgres, I've put together a quick little explainer video on what exactly "Serverless Postgres" means.

Neon Postgres

17,892 followers
3w

We talk a lot about "Serverless Postgres," but what does that actually mean for your daily workflow? By separating storage from compute, Neon gives you a database that actually behaves like modern software: 1. Autoscaling: It scales up when your traffic spikes and scales to zero when it doesn’t, so you don’t pay for idle compute. 2. Branching: Create full copies of your database in seconds. Use them for development, preview deployments, or safe AI sandboxes. 3. Point-in-Time Recovery: Treat your data like code. If something breaks, just roll back to the exact second before the error happened. All the Postgres you love, with none of the infrastructure headaches. Check out the breakdown in the video below!
Like Comment
To view or add a comment, sign in
Manoj Chaudhari
1w
Report this post
Post #4 in my observability series is live. Last week: We have seen how bad observability impacts organization. This week: Let's look at various key components of observability. In this blog, I have tried to cover three key pillars and a key fourth pillar "Event". https://lnkd.in/e-7fQwKN Comment if you see any other factor!! #Observability #SRE #DevOps #PlatformEngineering #EngineeringLeadership Minutus Computing

The Three Pillars of Observability minutuscomputing.com
Like Comment
To view or add a comment, sign in
Ruth Kegicha
1w
Report this post
We scaled a FastAPI service from 2 pods to 8. Throughput didn’t change. No errors. 😑 No alerts. 🤔 Just… no improvement. 🥲 The issue wasn’t Kubernetes. It was one early decision: synchronous database calls where async should have been used. But this isn’t an “always use async” post. Both have a place. Knowing when to use each is what separates systems that scale from ones that quietly bottleneck. Have you run into this before? https://lnkd.in/eVhQyBPW

Sync or Async? The Question That Decides Whether Your App Scales medium.com

1 Comment
Like Comment
To view or add a comment, sign in
Soban Taimoor
6d
Report this post
I recently worked on optimizing a backend system where API latency had become a serious bottleneck. At peak load, response times were inconsistent — affecting user experience and overall system reliability. Instead of scaling blindly, I focused on identifying the real issues: • Inefficient database queries (missing indexes, heavy aggregations) • Repeated calls to external services • Lack of caching for frequently accessed data Here’s what I did: → Optimized MongoDB queries and added proper indexing → Introduced caching for high-read endpoints → Reduced unnecessary external API calls → Refactored parts of the service for better separation of concerns Result: ~40% reduction in latency and much more stable performance under load. One thing I’ve learned: Scaling isn’t always about adding more resources — sometimes it’s about fixing what’s already there.
Like Comment
To view or add a comment, sign in
Pratham Mehta
2w
Report this post
If you want to learn how databases really work behind the scenes, check out this free YouTube series by Ben Dicken He has covered the entire "Database Internals" book videos where you will learn: • How databases store data using B-Trees • How storage engines work • LSM Trees and write optimization • Distributed systems and leader election • etc Over 10 hours of free content. Highly recommended for anyone into backend development. checkout: https://lnkd.in/gNNSbeWZ #Databases #BackendDevelopment #SoftwareEngineering
Like Comment
To view or add a comment, sign in
Mixpeek

916 followers
1w
Report this post
AWS shipped something last week that I think is a bigger deal than people are giving it credit for. It's called S3 Files. The short version: you can now mount any S3 bucket like a regular file system. ~1ms latency on cached data, the same bucket reachable through both the S3 API and POSIX, and it's already live in 34 regions. The reason this matters is that most tooling — data science notebooks, build systems, log processors, even the bash scripts you wrote five years ago — expects a normal file system. Until now you'd copy data out of S3 onto EFS or a local disk, run your job, and copy it back. S3 Files just deletes that step. And if you can co-locate it next to compute, there's no data egress at all. One nice side effect: agents get easier. A coding agent already knows ls, cat, and grep, so instead of writing a custom S3 tool wrapper, you can point it at a mount path and let it work the way it already wants to. The bigger thing I'd zoom in on is the pattern. Object storage stays the source of truth, and a faster layer sits on top as a cache. We use the same shape at Mixpeek for vectors — S3 Vectors underneath, Qdrant on top, retrieval pipeline on top of that. Same thesis, applied to two different abstractions: object storage is truth, the fast layer is a cache. mixpeek.com #AWS #S3 #CloudInfrastructure #DataEngineering #AIInfrastructure #ObjectStorage
Like Comment
To view or add a comment, sign in
Usama Rasheed
1mo
Report this post
We were at a breaking point. Our main orders table was hitting 100M+ rows, and query latency was spiking to 5 seconds. The team was already drafting a plan for Database Sharding We thought we had outgrown our infrastructure. We were wrong. Our dashboard was crawling. Every time a user searched for their order history, the CPU on our RDS instance pegged at 99%. We assumed the data volume was simply too high for a single node to handle. Before committing to the massive architectural overhead of Sharding, we ran a simple EXPLAIN ANALYZE on the offending query What we found was embarrassing: A Sequential Scan. The database was reading every single row on the disk to find a specific customer_id. Instead of a multi-month migration to a sharded architecture We added a composite index on (customer_id, created_at). Old way: The DB searched the entire "library" for one book. New way: The DB went straight to the "Index" at the back of the book and found the page number instantly. The results were immediate and saved us thousands in engineering hours and infrastructure costs: Latency: Dropped from 5s to 15ms. CPU Usage: Fell from 99% to 12% under peak load. Cost: We actually downsized our instance (Vertical Scaling in reverse!), saving 30% on our monthly cloud bill. The Lesson: Scaling is a ladder. Don't jump to Sharding Replication until you have absolutely mastered Indexing Query Optimization. Software engineering is often about doing the simple things right, not the complex things first. #SystemDesign #Database #SoftwareEngineering #Postgres #Scalability #Backend
Like Comment
To view or add a comment, sign in
Het Siddhapura
3w
Report this post
Most “Simple” APIs Fail the Same Way, Here’s What Actually Breaks First I took a minimal Flask API (one endpoint + PostgreSQL) and load tested it with k6 until it started to crack. The goal wasn’t to build a high‑performance system, but to understand how a typical service degrades under load and what scaling actually means. Setup: • Flask app with Gunicorn (4 workers) • PostgreSQL with a single indexed table • k6 running on a local machine • One endpoint: GET /data → simple SELECT query What Broke First (and Why): • CPU saturation - At ~350 req/s, CPU spiked to 100%. Response times increased linearly, why because the API logic itself request parsing, JSON serialization became the bottleneck before the database even saw the traffic. • Database contention - After increasing workers, the bottleneck shifted. PostgreSQL showed high wait events and connection queueing. Why? because Concurrent connections overwhelmed the database’s pool. Even simple reads slowed down under concurrency. Tradeoffs I Observed: • Caching → reduced DB load by ~70% but added ~200MB memory overhead • More workers → improved throughput until CPU became the limit again • Database indexes → lowered per‑query latency but didn’t solve concurrency spikes Have you load tested your own services? What was the first bottleneck you hit? #BackendEngineering #DevOps #SystemDesign #Performance #LoadTesting
Like Comment
To view or add a comment, sign in
Data Engineering Weekly

1,365 followers
5d Edited
Report this post
🚀 HOW TO TAMING S3 SHUFFLE AT SCALE In the latest edition of Data Engineering Weekly, we break down the Zero-to-One Guide to optimizing your data pipelines. 🛠 What’s Holding Your S3 Shuffle Back? 🔹Over-reliance on Local Disk: Traditional shuffle relies on local storage, which leads to "Disk Full" errors and prevents independent scaling of compute. 🔹Ignoring S3 Request Limits: Without proper partitioning and pacing, you’ll hit S3 503 Slow Down errors. 🔹Poor Data Serialization: Inefficient formats like Java default increase payload size, driving up network I/O costs and slowing down transfers. 🔹Lack of Adaptive Execution: Static partitioning fails with data skew. You need Adaptive Query Execution (AQE) to handle skewed keys and coalesce small partitions. 🔹Neglecting Cost Monitoring: API calls (PUT/GET) can cost more than the storage itself. Without lifecycle policies to clean up temporary files, your AWS bill will snowball. 💡 Deep Dive: This breakdown was inspired by the insightful work of Manoj Babu and Meghanath Macha in their recent article on optimizing S3 shuffle performance. 🔗 Read the full breakdown here: https://lnkd.in/gPmjEuDv #DataEngineering #AWS #S3 #BigData #CloudComputing #Spark #DataArchitecture #DataEngineeringWeekly
Like Comment
To view or add a comment, sign in

Neon Postgres

17,892 followers

View Profile Follow

Explore content categories