Amazon RDS for PostgreSQL

adrianh · on Nov 14, 2013

This is HUGE!

I'm using the AWS stack for http://www.soundslice.com/ and I've been using MySQL instead of Postgres, purely because my hatred for MySQL is less than my hatred for being a sysadmin. It was a tradeoff, and I miss Postgres dearly every time I use MySQL.

This new Amazon offering solves that.

I wrote a little more about my AWS setup here: http://www.holovaty.com/writing/aws-notes/

fat0wl · on Nov 14, 2013

Yea I was using MySQL on RDS just to escape the Heroku monolith. Funny that Heroku announced a new Postgres pricing model that is VERY expensive compared to old setup (https://news.ycombinator.com/item?id=6712570) just in time to be wiped off the map by RDS.

fat0wl · on Nov 14, 2013

hey who downvoted this without replying?!

Last time I badmouthed Heroku on here I got a reply from one of their employees asking me to fill out support tickets for errors I was getting in their apps......

O_o they're watching.

avenger123 · on Nov 14, 2013

No one's watching. Your initial comment just really didn't add any value. Calling Heroku monolith compared to Amazon is an understatement and I don't see Heroku being "wiped off the map" from this.

ceejayoz · on Nov 14, 2013

https://twitter.com/scottvdp/status/401111184092315648

> RDS 4XL vs Heroku Mecha: 40% cheaper on demand, 74% cheaper 1 year reserved, 82% cheaper 3 year reserved... with more features and capacity.

If that's true, Heroku's Postgres offering isn't going to do well. "Wiped off the map" may be an overstatement, but not by that much.

fat0wl · on Nov 14, 2013

If you listen closely you can hear the sound of an army of devteams uploading DB exports to Amazon....

& it would be louder if the rest could figure out where to put the custom DB URL parameter ;D

fat0wl · on Nov 14, 2013

I wasn't calling Heroku a monolith because they're a huge company -- it's because they suck you in & then youre forever stuck in their web of services. I bet they start bleeding a lot of money now that there is an easy-to-use alternative for Postgres hosting on a well-known platform. Heroku epitomizes the idea of overcharge + vendor lock-in & people only put up with their high prices because they don't have the time to figure out a better configuration. Database is the easiest thing to jump ship with, you just have to change 1 URL.

It's a shame that Amazon has some proprietary idea of a machine image, but my ideal hosting scenario is just App Server Machine Image + DB + any Machine Images required for extra services. Shouldn't have to choose between several paid services that are just different APIs into an app server component you should have direct access to.

I don't understand how the open source community can be so into Heroku given that it's basically just wrapping open source software & charging for it, getting away with it by saying they're charging for the admin UI or whatever.

rpedela · on Nov 14, 2013

Just like AWS, you pay Heroku to simplify your system administration. You don't have to like Heroku's product but there is a place for it. I do not use Heroku but it makes total sense if you have an app you need to deploy and you don't know system admin, don't want to do system admin yourself, or can't afford a system admin.

GitHub just "wraps up" git and charges for it and I am happy to pay them. Setting up a git server is a royal pain in the ass. Setting up a wiki, issue tracker, etc to go along with a git server is even more of a pain in the ass.

Well done cloud services make life simpler, and many people will pay for that simplicity regardless of how the internals are built.

krob · on Nov 18, 2013

you could always use gitlabs or phabricator and you get much of what you just said is a PITA.

kennystone · on Nov 14, 2013

This is not true at all. Apps on Heroku are built on open source software (Rails, Postgres, etc). You can always decide to host it all yourself without too many issues.

fat0wl · on Nov 15, 2013

How is this not true? It absolutely IS vendor lock-in, just in a slightly less sinister form.

If you add 5 services & then to switch hosts it's not another git push. You have to re-configure each of the services. If you'd done this yourself all along & made machine images it would have been less convenient at the time, but probably cheaper & a good learning experience. There are many other hosting platforms that offer Linux boxes so you could move your whole App/Software layer to another of these companies without much trouble.

I'm not saying Heroku is evil or anything. Yes, they provide a good platform. But I think any web company with a significant customer base would benefit more from the cost savings & freedom of a purer platform than the conveniences of Heroku.

Plus, there are so many configuration issues with their services. I have auto-scaling setup with Adept and still I see these long request-queue buildups now & then. I get the feeling I would not have the same issues with an AWS stack where I have CPU usage monitors that are very transparent & all the networking is trivial.

I still use Heroku as an app server & don't hate it enough to up & move (though a large factor in this is I'm not the one footing the bill, the client is...) but anything I can easily get on to AWS is a no-brainer. Database is one of those things -- a couple clicks to scale up/down every year is all thats really required.

CONCLUSION (cuz I rambled too much): I think Heroku offers scaling/convenience but AWS is just so rock solid & cheap that you can probably just buy larger instances than you need (to compensate for scaling) and have much better performance at the same price. Then you just need to learn how to install your tools & take a machine image as backup. Plus there's a lot of value in learning how to work with machine images that goes way beyond hosting a web app.

jaggederest · on Nov 15, 2013

> Plus there's a lot of value in learning how to work with machine images that goes way beyond hosting a web app.

Every minute I spend doing that is a minute I spend not doing things I enjoy. YMMV though.

fat0wl · on Nov 15, 2013

Yea I feel you on that but on Heroku I feel like "Every service I add is a per dyno credit card charge that is about to be multiplied by the number of hours in a month". :(

I prefer the Amazon model because you can stick these all on one server & as long as CPU isn't pegged at 100% you're good. I agree tho I'd rather not spend the time figuring it out. For now I only use it for RDS & for services not available on Heroku (some Adobe streaming server stuff).

simonw · on Nov 16, 2013

This is almost the complete opposite of vendor lock-in.

Vendor lock-in is when you write your software on Oracle or MSSQL and moving away requires you to rewrite your whole thing. It's not losing the convenience of "got push" for deploys and having to spend time moving off their hosted versions of open source software and configuring and hosting it yourself instead.

Accusing Heroku of practising vendor lock-in is honestly absurd.

fat0wl · on Nov 16, 2013

IMHO vendor lock-in is anything that makes part of your work process specific to a given vendor. The harder it is to move from one vendor to another, the deeper you are "locked in".

And as a point of interest, I think these days its probly much easier to convert your database than it is to switch hosting platforms (well... in some cases).

shaikhtabrez · on Nov 28, 2013

We started off on Heroku because we wanted something dead simple. The amount of time heroku saved us was an incredible value when we were starting out. I don't think there is any vendor lock in. We just did not switch to AWS earlier because our needs were met really well by Heroku and even Amazon Elastic Beanstalk (for ruby) did not come near the ease of a Heroku deployment. Once Opsworks came around, we invested in deployment scripts and switched because Opsworks gives us same ease of use and greater control of our stack.

What do you think stops you from switching? We had no issues at all - definitely none from Heroku.

We still use PostgresSQL from heroku because it is still a solid service and comes with niceties like dataclips. I should confess that I have not explored the Amazon PostgreSQL offering but I am happy with Heroku for databases at the moment.

catch23 · on Nov 16, 2013

Heroku is great for clients though. When I was a freelancer, I built these apps for other companies that assumed that I would do the admin & hosting. Heroku is great way to just "tack on a fee" for doing the hosting. If they want a cheaper option, they can always do it themselves. Most clients don't care about the savings on hosting..

avenger123 · on Nov 14, 2013

Fair enough. Looks like not a lot of love for Heroku for you.

This could be a nice fire under Heroku's ass to get more competitive.

eli · on Nov 16, 2013

Hard to compete with your main supplier, no?

eonil · on Nov 14, 2013

Also Heroku is not available in Japan region. Simply funny.

sync · on Nov 14, 2013

Hey, just took a look at soundslice, looks super nice.

FYI, it doesn't look your payment modal works on smaller browser windows: https://www.monosnap.com/image/UvJbxMQEwkzqLH5btNJ7a9G6Q ... no visible pay button, and the modal itself scrolls when you scroll the page.

adrianh · on Nov 14, 2013

Thanks very much -- I've just done a band-aid solution and moved the payment modal up, but we'll get a better solution up soon. Much appreciated!

pallandt · on Nov 15, 2013

Chiming in with some more appreciation! Great stuff!

mmcclellan · on Nov 14, 2013

Came here to say "This is huge!", found Django creator beat me to it.

pc86 · on Nov 14, 2013

Let me just say that I absolutely love soundslice. It's a great application!

adrianh · on Nov 14, 2013

Thanks! And it'll be even better once I migrate to Postgres. :-)

rabidonrails · on Nov 14, 2013

Wow, Soundslice looks amazing! Excited to play around with it.

saltyknuckles · on Nov 14, 2013

Its actually really really impressive

coolrhymes · on Nov 17, 2013

Good to see another Django creator. We are a proud django startup @kipinhall I couldn't believe my eyes when I read RDS Postgres offerings.

saltyknuckles · on Nov 14, 2013

Hell fuck yeah this is huge!

yumraj · on Nov 14, 2013

I've heard about PostgreSQL and know that HN community raves about it, but am currently using RDS with MySQL.

Does it make sense to migrate to PostgreSQL, I don't have a lot of data as I'm in the early stage? What are the primary advantages that PostgreSQL provides over MySQL? Any advise/pointers is appreciated.

adrianh · on Nov 14, 2013

Postgres actually cares about your data.

Perhaps you mistakenly insert "2013-10-32" into a date column. MySQL will silently convert this to "0000-00-00" (!!). Postgres will raise an error.

Perhaps you make an error in a transaction. MySQL lets you keep doing subsequent things in the transaction. Postgres treats the transaction as invalid and forces you to start over.

Perhaps you want to add a column to a table that has millions of rows. With MySQL, you'll be waiting a looooong time (see http://stackoverflow.com/questions/463677/alter-table-withou...). With Postgres, it takes about a second.

Of course, there are things you can do to make MySQL less horrible, and this is a generalization. But Postgres is just more respectful and more solid.

Oh, and PostGIS (Postgres' geo add-on) is by far the best open-source geospatial database. If you're doing anything with geographic queries, you need to be using it. MySQL's stuff is laughable in comparison.

Context: I've dealt extensively with both databases, both from the perspective of a framework author (Django) and a developer making products. I've used both databases on and off since 2001.

eli · on Nov 14, 2013

Hey at least it throws a warning on the bogus date conversion (MySQL warnings should almost always be treated as fatal errors).

The thing that kills me with MySQL (technically it's with InnoDB-based storage enginges in MySQL) are the subtle quirks. Like the thing where it insists on writing temporary tables to disk if you do a query that selects TEXT or BLOB fields. Even if they could have easily fit in memory, it's not smart enough to be able to determine that with variable length fields. A very non-obvious performance killer unless you're specifically looking for it.

jacques_chester · on Nov 15, 2013

I run a small Wordpress network. MySQL's insistence on going to disk for joins on tables with a TEXT field (even if the query doesn't touch those fields) is probably the major performance bottleneck.

e12e · on Nov 15, 2013

Ouch. Thanks for the explicit heads-up. I generally stay with PostgreSql, but I honestly thought most of these things were "fixed" in recent versions of MySQL (and assumed my subconcious dislike of MySql was at least partly irrational/rooted in Ancient and Outdated Lore). I guess not:

http://dev.mysql.com/doc/refman/5.7/en/internal-temporary-ta...

I'm not sure what the status of TEXT-fields are in mariadb:

https://mariadb.com/kb/en/optimizing-string-and-character-fi...

jacques_chester · on Nov 16, 2013

> Thanks for the explicit heads-up.

It took me a long term to learn this one. I suppose if I'd read the MySQL docs from cover to cover I would've found it earlier.

One other problem that popped up was ignoring indexes on tables with TEXT fields during joins, which was a planner weakness. I understand it was fixed in 5.6; I'm waiting for the Percona version to stabilise before I upgrade.

eli · on Nov 15, 2013

I tried MariaDB about a year ago and it had the same problem. It's possible it's been fixed since. I personally prefer the Percona fork of MySQL, which has some performance tweaks yet is basically a 100% drop-in replacement.

user9903 · on Nov 15, 2013

I don't think it's fair to say Mysql doesn't care about your data. For example,

> Perhaps you mistakenly insert "2013-10-32" into a date column.

Only with ALLOW_INVALID_DATES sql mode set. As of 5.0.2, the server requires by default that month and day values be legal, and not merely in the range 1 to 12 and 1 to 31.[1]

> Perhaps you make an error in a transaction. MySQL lets you keep doing subsequent things in the transaction.

If you care about transactions you should have STRICT_TRANS_TABLES on.[2]

[1] - http://dev.mysql.com/doc/refman/5.6/en/server-sql-mode.html#...

[2] - http://dev.mysql.com/doc/refman/5.0/en/server-sql-mode.html#...

Daishiman · on Nov 14, 2013

Best open source geospatial? It handily beats Oracle and MSSQL!

ChikkaChiChi · on Nov 14, 2013

I've rarely seen a "Why not X?" so concise in getting its point across. Thanks!

Roboprog · on Nov 16, 2013

Thanks for the updated list. It's been a LOOOOONG time since I compared MySQL vs PostGreSQL in detail (around 2001), but what I found at the time made me never want to look at MySQL again.

Understand, MySQL-ers, I know that your DB has been patched A LOT over the last decade, but running with something that did not support ROLLBACK (nor isolation), nor foreign key constraints???

That the MySQL team even thought they could call such a thing a database terrified me, and made me quite scared to ever trust their judgement. (viewing the comments for this article suggest to me that playing those odds was the right thing to do, as well, rather than simply "prejudice")

That, and at the time, PostGreSQL was supporting stored functions that fit anywhere in SQL statement syntax that the return type matched the needed expression type (scalar, vector/row, matrix/table), and could be written in a PL/SQL work-alike OR alternate loadable languages, while MySQL had no stored procedures at all.

That, and at the time (already), PostGreSQL supported OOP-ish "extension" tables that extended other tables with extra, specialized, columns. Rows in the specialized, subclass, table would show up in the generalized, superclass, table (sans extra columns), but the subclass table would only show the relevant specialized type rows, with non-null columns where needed. Other DBs required you to join 2 tables and manage joins and a view to do this.

Putting SQL syntax on top of an ISAM engine just makes it dBase with awkward syntax, and that's not an environment I wish to revisit. (I know that InnoDB is constantly twiddled to suck less, but that back-end was extra back in the day, yes?)

lhaussknecht · on Nov 15, 2013

Are add-ons such as PostGIS supported on RDS?

lhaussknecht · on Nov 15, 2013

Hehe, never mind. It's the first of the offered features...

eLod · on Nov 14, 2013

i myself heavily favor pg over my also, but just for the record mysql has some kind of support for that, see http://dev.mysql.com/doc/refman/5.0/en/server-sql-mode.html#...

jeffdavis · on Nov 14, 2013

(Disclaimer: I'm a postgres community member.)

I think the biggest advantages are trust and flexibility.

Trust, because PostgreSQL language semantics are cleaner, closer to the SQL standard, and less likely to surprise you. And postgresql just has a good reputation for traditional engineering quality.

Flexibility, because it offers a lot of APIs and features that can be very useful to adapt your application as needs change. You don't have to go crazy with features, but even simple apps can benefit a lot from prudent use of them -- a trigger here, a foreign table there, or LISTEN/NOTIFY (for cache invalidation) can just save you a huge amount of work and make the system more robust overall. The extension mechanism is very powerful.

Before making any big decisions, do a trial migration and see what you think.

gdulli · on Nov 14, 2013

edited to add: - timestamp with time zone

- More robust, fewer crashes, less corruption of data

- More features (JSON data type, partial indexes, function/expression indexes, window functions, CTEs, hstore, ranges/sequences/sets, too many to list)

- More disciplined (doesn't do things like auto-truncate input to get it to fit into a column)

- Not owned by Oracle, it's actively developed, regular major release schedule, etc.

- Better Python driver (don't know about other languages)

- Choice of languages for database functions/procedures (Python, JS, etc.)

- Better partitioning support

- Better explain output, explain analyze, buffers

- Multiple indexes allowed per table in a query

elob36 · on Nov 14, 2013

To me, the MySQL query optimizer (single index in a query) was the proverbial straw that broke the camel's back.

My group is developing all new applications in PostgreSQL and we would like to migrate our legacy apps away from MySQL.

unclebucknasty · on Nov 15, 2013

The single index in a query issue seems to have been addressed as of 5.0:

http://dev.mysql.com/doc/refman/5.0/en/index-merge-optimizat...

will_work4tears · on Nov 15, 2013

I'm on mobile and cant quickly and easily search this but is there a good GUI for postgre similar or superior to phpmyadmin?

epo · on Nov 15, 2013

Dbvisualizer (http://www.dbvis.com/) works with lots of databases and is really pretty cheap. Disclaimer: no connection other being a long term customer, I use it with Oracle, MySQL and Postgresql.

aftabh · on Nov 15, 2013

Try pgAdmin at http://www.pgadmin.org/

Edit: You can also check phppgadmin at http://phppgadmin.sourceforge.net/doku.php

johnyzee · on Nov 15, 2013

TeamPostgreSQL is a very good web GUI for PostgreSQL: http://www.teampostgresql.com

billmalarky · on Nov 16, 2013

Check out Navicat and you will never touch phpMyAdmin again.

pnathan · on Nov 14, 2013

Great question!

The basic answer is (from what I can tell) that postgres ships with sane defaults. Lots of little gotchas exist in mysql that experienced mysql dbas know to deal with and avoid. On the more sophisticated side, pgSQL seems to have a focus on being "Really Awesome For Experts" where MySQL seems to be focusing on "Being Easy to Get Started".

I've been using pgSQL for my side project and for my "personal tooling" at work, and I can honestly say that it's just as easy for me as MySQL was for the same sort of things.

the_mitsuhiko · on Nov 14, 2013

> Does it make sense to migrate to PostgreSQL, I don't have a lot of data as I'm in the early stage? What are the primary advantages that PostgreSQL provides over MySQL? Any advise/pointers is appreciated.

If MySQL works for you, there is no reason to change. Some people have quite a dependency on Postgres (hstore, JSON, transactional DDL, pubsub, PLPGSQL etc.)

jeffdavis · on Nov 14, 2013

Ordinarily I would agree, but in this case, the poster said that there's not a lot of data yet so it would be easier to migrate. Choosing a database is a pretty major decision with long-term impact that might not be obvious now, so it's worth exploring a few options if you have that luxury.

I think there are two times when it makes sense to consider a DB migration: (1) early on, when it's easy; (2) if you are in major trouble with your existing DB.

gtaylor · on Nov 14, 2013

> Some people have quite a dependency on Postgres

Our dependency on Postgres is more to do with it being a lot safer with our data. It doesn't silently fail, the transactions have a better failure mode, and as previously pointed out it handles ALTER in production a LOT better than MySQL.

The combination of safety, speed/efficiency in schema/index alteration, and its increasingly good performance are why we depend on it. The rest is just gravy.

rachbelaid · on Nov 15, 2013

Just DDL transaction make life so much easier but there is much more like Materialized Views, Advanced Data type, Efficient JOIN, pg_stat_statements, PL, full text search, GIS features (pretty useful to do near by query), fuzzy search, ISO SQL standard compliant, Piece of cake replication.

Do some search and give it a try one weekend.. Trying it is loving it.

Sanddancer · on Nov 15, 2013

Hating to pile on, but doing so anyways, one of the overlooked things I'd argue is that postgresql tries much harder to let you do neat things with your data without much hassle. Needing to write a custom function to process data? Not only does PostgreSQL have support for PL/PgSQL, but it's made in a way that has let people write plugins to python, perl, v8, tcl, etc, so you can do that data transformation on the database instead of shipping it back and forth to another server with the latency that brings. The recent version also brings read/write support to redis, so you have have your updates trigger redis to store new values, etc. Plus, being that it's not controlled by Oracle, there's much less a feeling of the community being held back; MySQL feels very much like Oracle's tying one hand back, so that when you get to the size to need a "real" database, Oracle will gladly step in, at their standard consulting fees, of course.

rektide · on Nov 14, 2013

If you didn't care yesterday, I HIGHLY ENCOURAGE AND RECOMMEND YOU NOT CARE TODAY.

elob36 · on Nov 14, 2013

He cares today because he now has an option.

falcolas · on Nov 14, 2013

I have but one suggestion. Unless RDS for PostgreSQL is drastically different from RDS for MySQL, you still need a DB administrator.

RDS removes remarkably little of the pain of running a database instance (most of the pain that's removed is just the up front setup), and ends up adding a lot of inconveniences for your day-to-day operations.

Also don't count on their replication as your backup.

OK. That's two suggestions, but I think it's OK.

acdha · on Nov 15, 2013

> Also don't count on their replication as your backup.

This statement seems like a complete non-sequitur: does anyone credible recommend replication as a backup strategy? It's like dinging a server vendor because you can't rely on RAID as a backup plan.

bsaul · on Nov 14, 2013

Any link to a page providing more info on those quircks ? I'm deciding between a traditionnal ec2 instance running postgres or rds at the very precise moment !

falcolas · on Nov 14, 2013

The long and short of it is that you do not have an account with super access (you can't perform normal actions like killing threads, viewing processlists, change variables, etc). You instead have to use special stored procedures to perform basic administrative functions.

Also, you can't take advantage of replication (aside from their own read replicas within other RDS instances) or binary backups. Anything that requires access to the machine itself is impossible except through Amazon's support channels.

Its gotten better than it was, but it's still a headache to monitor and manage as a DBA.

ptrf · on Nov 14, 2013

For MySQL RDS instances, it's most certainly possible to do offsite asynchronous replication without the use of read replicas, as described in their documentation here: http://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/MySQL....

The guide does mention using a read replica to replicate from, an intermediate RDS instance between your offsite, but I've had no trouble replicating directly from the master instance.

One thing they don't cover is replication over SSL. AWS has failed to mention this shortcoming in the docs last time I checked. To have MySQL replicate over SSL, the master and slave both need an SSL certificate signed by the same CA, which would require you to obtain a cert+key signed by the AWS RDS CA.

Of course you have the option of tunneling the replication connection into a haproxy or stunnel running on an ec2 instance, but that has it's other shortcomings. You can't use the ELBs, since you can't register the RDS instance with an ELB.

rschmitty · on Nov 14, 2013

Am I missing something or not following correctly? You can view processlist and kill queries like normal and you can edit variables via their control panel. I'd assume you can't edit every variable

cbsmith · on Nov 14, 2013

As someone who is familiar with the rigamarole of PCI compliance, I respectfully disagree.

falcolas · on Nov 14, 2013

Disagree with what? Keeping your own backups? Probably not - if you're not keeping offsite backups, you're begging to go out of business.

With needing a DBA, even if you're still on RDS? I don't see what that has to do with PCI compliance.

With RDS only removing the up-front setup pain, at the cost of ongoing maintenance... as someone who is also familiar with PCI (and HIPPA, and DOD) compliance, I respectfully disagree with your disagreement (well, if you're working with DOD, AWS isn't even an option to begin with).

Given the choice to run my own instance of a DB on a AWS instance (which carries the same certifications as RDS), and use RDS, I will run my own instance every time. The setup just isn't onerous enough to justify the daily productivity cost.

cbsmith · on Nov 14, 2013

> Disagree with what?

I disagree with the notion that RDS removes remarkably little of the pain of running a database instance.

Yes, there are projects where RDS is not a great solution, but it definitely simplifies a lot of stuff. The notion that it "only removes up-front setup pain" is silly. If you manage your databases correctly, up-front setup pain should be the vast majority of all your basic admin operations. The "at the cost of ongoing maintenance" part is a real head scratcher for me. RDS basically gives you everything you'd have with a DB on an AWS instance except a local login, which one tries to avoid using like the plague anyway.

falcolas · on Nov 15, 2013

> The "at the cost of ongoing maintenance" part is a real head scratcher for me

Let's look at a common problem that DBAs are typically given: "The Database is slow!". Let's troubleshoot this ficticous problem on RDS:

Am I being affected by a noisy neighbor? Can't tell; contact Amazon support.

Can I look at top to see if the load is high on the box, and potentially why? No. I can look at historical trends, but not with enough granularity or information to be useful.

Can I look at the disk iops to see if there's any kind of problem there? No. Complete black box here; contact Amazon support.

Can I look at the slow log? Kind of. They'll push the slow log data into the database for you to query, but then you can't use tools to do aggregate tracking.

Pause for a moment for a quick MySQL RDS tip: pt-query-digest has a mode of operation that lets you do a processlist every 1/100th of a second and turn that into a pseudo slow log, which does work for RDS.

    pt-query-digest --processlist h=10.0.0.1 --interval=0.01 --output slowlog > /tmp/fake_slowlog.out

Back on track - so no real analysis of a historical slow log, without writing your own tools. Possible, but time consuming.

Can I kill queries? Yes, using a stored procedure. Can't use any of the existing toolset around this (like pt-kill, which can help keep poorly written ad-hoc queries from getting out of hand).

So, after many hours swapping emails with Amazon support, we've determined that we're actually spending a lot of time waiting on malloc mutexes. The internet says that using a non-default version of malloc will help with that - can I do that?

Nope. You're stuck.

Other things you can't do:

* Offsite backups that are in any form but MySQL dumps.

* Take advantage of new index types and compression support from TokuDB.

* Zero downtime failovers (We were able to help someone fake this; it was a PITA).

* Cross-region replication.

* Automated failovers using a reputable tool (MMM, MHA, etc).

* Access the error logs.

* Run multiple instances on one machine.

* Alter the disk elevator (hopefully they're using something sane, like noop, but we'll never know)

* Alter the kernel swappiness.

* Troubleshoot crashes.

* Monitor and alert on a machine's vitals.

Now perhaps I'm just being a power-hungry admin, but these small things matter. They are the difference between a snappy DB which scales beautifully to 10,000+ QPS, and a sluggish DB that causes you to move to bigger hardware, because it's the only option open to you.

Databases just aren't that hard to set up. Install packages, install config files, start the DB, restore from a backup file, restart the DB, and you're golden. If you're particularly paranoid, set up the selinux contexts (I'd bet dollars to doughnuts that this isn't done on RDS instances), and create a security group that limits access to only the 22 and 3306 ports to your application hosts, and set up individual users.

This is particularly simple when you use an orchestration tool; I recommend Ansible personally.

cbsmith · on Nov 15, 2013

> Am I being affected by a noisy neighbor? Can't tell; contact Amazon support.

Sure, you can. Spin up multiple RDS's and benchmark them.

> Can I look at top to see if the load is high on the box, and potentially why?

If you ar using top to monitor your box, you are already screwed. There is lots of support for remote monitoring.

> Can I look at the disk iops to see if there's any kind of problem there?

Disk iops are part of the built in monitoring and metrics provided with RDS.

> Can I look at the slow log? Kind of. They'll push the slow log data into the database for you to query, but then you can't use tools to do aggregate tracking.

If only there was a tool that could extract records from a database and compute aggregates...

> So, after many hours swapping emails with Amazon support, we've determined that we're actually spending a lot of time waiting on malloc mutexes. The internet says that using a non-default version of malloc will help with that - can I do that? >Nope. You're stuck.

MySQL sucks. RDS provides no means to make it any better. Fortunately they do now provide PostgreSQL.

> Offsite backups that are in any form but MySQL dumps.

You can do that by replicating to an external MySQL server and doing whatever the heck you want with it.

> * Take advantage of new index types and compression support from TokuDB.

Yup. Until today it was also really hard to take advantage of different engines found in PostgreSQL. ;-) This is a totally different product.

In general, all of the stuff you are describing are features, not things that cause maintenance complexity. In fact, manipulating those things causes maintenance complexity.

> Databases just aren't that hard to set up. Install packages, install config files, start the DB, restore from a backup file, restart the DB, and you're golden.

I had no idea PCI compliance could be that simple. ;-)

> This is particularly simple when you use an orchestration tool; I recommend Ansible personally.

Yes, orchestration tools, if set up properly are exactly how you'd want to do this kind of thing. If you already have all that setup to manage your database, RDS is likely not going to help.

oceanplexian · on Nov 15, 2013

FYI RDS is no longer HIPAA compliant following the latest Omnibus legislation.

You need to be running dedicated instances inside of VPC with your own DB install.

tracker1 · on Nov 14, 2013

I think this is great news... PostgreSQL is definitively my favorite open-source database. It's also nice to see Amazon get into the game, as hosted pg options have been fairly limited. I am slightly disappointed to not see the server-side JS support baked in, and that apparently you can't do reads from distributed replicas. Just the same, I think there will be a lot of progress in this area.

Administering databases is a full-time type of responsibility. Yes, you can get pretty sane defaults, and up and running without much difficulty with MS-SQL, and mySQL has been a defacto standard in the LAMP stack. That said PostreSQL has been a rock solid RDBMS. The commercial extensions for replication have been cumbersome and expensive. Here's hoping that AWS will grow/expand the replications options/features, and that they'll grow to include JS procs as that feature stabilizes.

j-kidd · on Nov 15, 2013

It looks like the Multi-AZ setup is using block level replication such as DRBD instead of the built-in replication:

> Database updates are made concurrently on the primary and standby resources to prevent replication lag.

Makes me feels better for setting up my own pg cluster on EC2 a week ago, which does allow reads from the replication slave. Plus, I can provision <1000 IOPS (provisioned IOPS is damn expensive with AWS), and get to use ZFS.

angrybits · on Nov 15, 2013

The time spent researching and building that cluster was time not spent building product. If you are low on capital then it was potentially a good tradeoff, but I'd much rather just scale up my read capacity by adding horsepower until they get around to adding read replicas. My goal is to spend as little time as possible administering my infrastructure, and running my own database cluster is the last thing on my mind.

frakkingcylons · on Nov 14, 2013

I'd be most curious to know how the performance/dollar ratio stacks up to Heroku's Postgres offering.

josephlord · on Nov 14, 2013

From a very quick glance at the small end the Amazon options are a little more than half the price (Ireland single AZ instance pricing and even less reserved) but multi-AZ options are probably very similar.

I don't actually know whether Heroku can failover across an AZ failure.

craigkerstiens · on Nov 14, 2013

disclaimer: Heroku PM here

By default our followers and HA is automatically cross AZ. You also have an ability to create followers across region, but we do not automatically failover on those due to latency.

giovannibajo1 · on Nov 14, 2013

Can you please explain the value proposition of using Heroku Postgres vs RDS now?

Heroku's solution is 2 to 4 times more expensive for the same type of DB, and RDS even allows for reserved instances to further lower the bill.

craigkerstiens · on Nov 14, 2013

There are a variety of differences overall.

One area is we're focused more on delivering more guidance and expertise around what you're doing with your database, in addition to ensuring your database is healthy and running. An example of this is notifications that we deliver around unused indexes, where you may benefit from other indexes, or other places where you can quickly optimize your DB. This starts to free up a DBA to add higher value tasks or for smaller shops lets you get by longer without a need for a DBA.

Another big area is features we deliver on top of Postgres. This ranges from followers which all you to easily scale read traffic, or allow for your database to be replicated across not just AZs but also regions. There's the other spectrum of this as well including dataclips. Dataclips make it easy to share data in a simple way, as well as build richer dashboards by integrating with google docs, or quickly prototyping APIs.

If you're curious on various technical details we'll be documenting that soon but would be happy to correspond via email, craig at heroku.com

giovannibajo1 · on Nov 15, 2013

Like others, I also agree dataclips are very cool (I even blogged about them: http://giovanni.bajo.it/post/51299442521/making-monitoring-g...).

I understand that "forks" and "follows" are easy concepts, stats are cools, etc. but I personally wouldn't want to pay double or triple for that (and I'm not a DBA, so I feel the pain). Not that my word counts much as I'm on the smallest Heroku production plan, so I guess my bill isn't exactly interesting for this kind of discussion. In my case, I would say that even saving $25 off $50 can be offset by these additional niceties. But, I don't know what I would think if I were a customer paying $1000/mo.

jontonsoup · on Nov 14, 2013

I just switched off heroku after using it for a couple years(sorry! I wish I could have stayed). I will attest dataclips are really cool, but your guy's pricing is out of wack (and customer support is eh).

dpritchett · on Nov 14, 2013

Heroku customer here, can verify that dataclips are awesome. Haven't done the math on price vs. performance lately though.

fdr · on Nov 14, 2013

> From a very quick glance at the small end the Amazon options are a little more than half the price (Ireland single AZ instance pricing and even less reserved) but multi-AZ options are probably very similar.

Did you add disk? People seldom do, I can attest it's a big part of the bill...

> I don't actually know whether Heroku can failover across an AZ failure.

"Followers" have preferred another AZ for quite some time -- almost since the beginning -- and Heroku Postgres HA is based on similar technology. So, yes.

josephlord · on Nov 15, 2013

No, I didn't take storage or IOPs into account but you do get more memory and the flexibility if the Heroku options don't fit you.

So pricing wise both double if you go multi-AZ.

I didn't claim to have done a detailed price comparison but a quick look at the small sizes which are currently the ones relevant to me.

fdr · on Nov 15, 2013

> No, I didn't take storage or IOPs into account but you do get more memory and the flexibility if the Heroku options don't fit you.

No doubt about that. To date, Heroku's product model has preference for fewer, but common choices rather than more flexibility. Heroku's staff sees fit to give in sometimes, but coarsely speaking, that's pretty much how it goes.

erikcw · on Nov 15, 2013

One other issue to consider is that AWS Micro DB Instances have "low" I/O capacity and no provisioned IOPS.

Last I heard Heroku is hosted on top of AWS. Does anyone know if Heroku's cheapest Postgres plan is hosted on a dedicated AWS Micro instance, or do they buy large AWS instances and host multiple databases on each box thereby potentially providing more IO performance?

dragonwriter · on Nov 15, 2013

IIRC, you get a dedicated database not a dedicated PG cluster, so presumably their will be multiple plans hosted on the same underyling server instance.

cmelbye · on Nov 14, 2013

The Multi-AZ options aren't similar, they're half the price as well (comparing a Multi-AZ RDS setup with a Heroku Postgres instance that has an equivalent follower instance.)

thinkalone · on Nov 14, 2013

Here's the announcement on the AWS blog - http://aws.typepad.com/aws/2013/11/amazon-rds-for-postgresql...

rachbelaid · on Nov 14, 2013

Great news and for people who ask them-self the questions but the version is Pg 9.3.1.

Not all Pl are available, and it misses the PL/V8 and PL/Python at least.

And it seems that all fdw (Foreign Data Wrapper) extensions are missing.

But it's a great start, I'm looking forward to try.

If anybody know if we can still access the WAL log then it will be very useful

http://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/CHAP_P...

aschreyer · on Nov 14, 2013

PL/Python is very unlikely to appear soon unfortunately because it is an untrusted language in PostgreSQL.

EDIT: you are right, PL/Python is supported as well. I only read the "Language Extensions :PL/Perl, PL/pgSQL, PL/Tcl" part at the top.

bmaeser · on Nov 14, 2013

quote: "Three language extensions are included with PostgreSQL to support Perl, Python and Tcl"

(http://aws.amazon.com/de/rds/postgresql/#language-ext)

aschreyer · on Nov 14, 2013

PL/Python is NOT supported: http://www.databasesoup.com/2013/11/first-look-at-postgresql...

rachbelaid · on Nov 15, 2013

Agree ... We may not see PL/Python very soon until somebody solve the sandboxing issues.

But I was surprise to not see PL/V8 which is sandboxed

elchief · on Nov 14, 2013

Ack, half my stuff's in PLV8. Oh well.

rachbelaid · on Nov 15, 2013

I won't be surprise to see it supported soon.

onedognight · on Nov 14, 2013

This is great news, but unlike for Amazon's MySQL offering, they do not yet support read replicas for PostgreSQL.

skyebook · on Nov 14, 2013

Yeah I was disappointed to see this.. Can you configure pgpool externally to talk to the replicas or are those multi-az instances a black box? I've never used RDS before so I'm curious as to how much flexibility is afforded.

timdorr · on Nov 14, 2013

Unfortunately, no. Amazon RDS instances don't provide console access and are essentially black boxes.

yanivt · on Nov 15, 2013

just to clarify, this isn't true ^. I log into my RDS mysql console all the time.. mysql -h -u -p from a properly configured host..

scosman · on Nov 14, 2013

Still beta, so I'm hoping that's added soon. Won't be considering RDS until it is.

ptio · on Nov 14, 2013

Great news! Now I don't need to setup PostgreSQL myself.

Here is a related whitepaper if you still want to setup PostgreSQL yourself: http://media.amazonwebservices.com/AWS_RDBMS_PostgreSQL.pdf

revetkn · on Nov 14, 2013

Anyone know if the UUID type is supported? I see hstore and JSON are

Edit: It does, see http://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/CHAP_P...

noviceapp · on Nov 16, 2013

Hmm, interesting development. But am I the only one to see a huge gap in services between Amazon and Heroku, and one which most certainly bodes bad for Heroku? Specifically, with Heroku's recent PG 2.0 service, their 'production grade' Standard plan ranges from $50 - $3500, but "Up to 1 hr downtime per mo." ?!?1! Huh, really, with up to an hour downtime /month, you couldn't be serious about the product that runs atop this tier - a non starter me any most other SaaS businesses. Heroku's cheapest "high availability option starts at $200/month, still with "Up to 15 minutes of downtime per month."... still, this is a concern for me...

Now, comparing to Amazon, their '1.7 GB memory Small DB, 1-year reserved, multi-region' is around $28/month (with storage & transfer for my app no more $35/ month). The equivalent Heroku plan, Tengu (1.7 GB mem) STARTS at $350/month!!! Wow, not I'm really rethinking my hosting platform.... Amazon looks more attractive, even if I have to do a bit of sys admin for my web server/cloud server.

frankwiles · on Nov 14, 2013

Been waiting for this for a long time, can finally nudge our clients that want to use RDS (for various reasons) to use a sane DB. Thanks Amazon!

tzaman · on Nov 14, 2013

I guess now's the time to start thinking about leaving Heroku...

mike_herrera · on Nov 14, 2013

Heroku will still be my go-to for quick tests and experiments. But for larger projects it would be foolish to not consider PostgresRDS + Docker on AWS. *assuming they both leave beta.

gklitt · on Nov 14, 2013

My thoughts exactly. Lack of Postgres support on AWS has always been the one thing keeping me on Heroku.

jakozaur · on Nov 14, 2013

Wondering what is the easiest way to convert from MySQL RDS to Postgres. Any hints?

caioariede · on Nov 14, 2013

I've used py-mysql2pgsql [1] sometime ago and it worked without any troubles.

I expect this helps you.

[1] https://pypi.python.org/pypi/py-mysql2pgsql/0.1.5

cjg_ · on Nov 14, 2013

Fully agree on py-mysql2pqsql, awesome little tool.

MrGando · on Nov 14, 2013

I think that the 'simple monthly calculator' hasn't yet included PostgreSQL instances?

runako · on Nov 14, 2013

MySQL (free) and Oracle BYOL (no additional license cost) are the same cost at RDS. Since PG is free, I'd guess the cost will be the same as MySQL.

MrGando · on Nov 14, 2013

That's what I think too, they should state MySQL/PostgreSQL or something like that in the calculator though.

amock · on Nov 14, 2013

PostgreSQL costs more than MySQL according to http://aws.amazon.com/rds/pricing/ .

eonil · on Nov 14, 2013

Price difference seems huge. Looks more than Oracle? Can anyone explain why?

runako · on Nov 15, 2013

Either they updated the table since you posted, or you're misreading it. MySQL Large is $0.32/hr, PG Large is $0.336/hr, Oracle is $0.565/hr.

I don't know why the MySQL/PG prices are different.

eonil · on Nov 15, 2013

My fault. I misread it. PostgreSQL has only heavy-utilization unit on reserved instance. I thought it's light utilization unit.

Heavy/Large/1yr(reserved, multiAZ)/Tokyo: PostgreSQL = $1548, Oracle = $2440

Seems reasonable price. And I don't care MySQL price because it's not an option to me.

michaelbuckbee · on Nov 14, 2013

How does this compare to Heroku's PG offering?

_gtly · on Nov 14, 2013

Good question. Heroku offers an RDS plugin now only for mysql: https://devcenter.heroku.com/articles/amazon_rds

If/when a Heroku RDS plugin for Postgresql arrives, competing benchmarks, a cost calculator, pros and cons would be very interesting, indeed.

Heroku pricing: https://addons.heroku.com/heroku-postgresql

Amazon pricing: http://aws.amazon.com/rds/postgresql/

cmelbye · on Nov 14, 2013

Doesn't their RDS add-on merely set DATABASE_URL to the URL that you provide when adding it? Are you sure it doesn't work with MySQL? There's no reason you'd need to wait for Heroku to add support for this, just spin up an RDS instance in us-east-1 and start using the URL Amazon gives you as your database URL.

bjeanes · on Nov 15, 2013

I'm on the Heroku Add-ons team. This is correct. The RDS "add-on" is there simply for people who search for it. In reality, it's just setting DATABASE_URL, and you can do the same yourself.

eonil · on Nov 14, 2013

Heroku PG doesn't support isolated secure networking only within your hosts.

Though it was free test version, I surprised to see my database is living together with many other people. (`\l`)

Your database is available publicly, and you have only minimal security.

jeffdavis · on Nov 14, 2013

Does anyone know if it's pure community PostgreSQL or if there are modifications? If there are modifications, is it 100% compatible?

philliphaydon · on Nov 14, 2013

OMG I LOVE POSTGRESQL!!!!

This is awesome news! I hope we can move to it at work!

bmaeser · on Nov 14, 2013

this is awesome! still, i miss one essential feature - compared to mysql on aws and heroku/postgres: there is no replication for read replicas. yes, you can deploy a "hot standby" replica in another availability zone for failover, but you cant read from it.

zwily · on Nov 14, 2013

You can be sure that that will come.

oceanplexian · on Nov 15, 2013

Probably not. Amazon has intentionally crippled external replication in MySQL for years now.

It's a form of vendor lock-in and you shouldn't support it.

Without replication it's impossible to migrate out of RDS without taking downtime (I've been through this and it was painful).

sendob · on Nov 15, 2013

This seems to have changed:

http://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/MySQL....

zwily · on Nov 25, 2013

This isn't true anymore. RDS now exposes an external replication stream for Mysql. AWS actually doesn't seem to be that interested in vendor lock-in. They usually build what people ask for.

simonebrunozzi · on Nov 14, 2013

Useful tool to migrate from MySQL to PostgreSQL : https://github.com/lanyrd/mysql-postgresql-converter

spwestwood · on Nov 14, 2013

py-mysql2pgsql is good too: https://pypi.python.org/pypi/py-mysql2pgsql

vocino · on Nov 14, 2013

This will make the ease of Elastic Beanstalk much more appealing.

kvinnako · on Nov 14, 2013

Somehow it seems pretty costly...Wouldn't it be cheaper for me to just run it on my virtual instance that's hosting the app and probably just back it up with S3.

jon-wood · on Nov 14, 2013

If you place no value on you're time then yes, it would be cheaper to just run it on your own EC2 instance. There's a lot to be said for not having to worry about the day to day of maintaining a database server though.

bjeanes · on Nov 14, 2013

Well said. It's frustrating how much people want to do price comparisons on the commodity instead of the service.

You're paying for the "as a service" part of PaaS, IaaS, SaaS, *aaS.

cmelbye · on Nov 14, 2013

Costly? This is dirt cheap compared to what we've been using.

victorhooi · on Nov 14, 2013

I'm curious what the performance characteristics of this would be like.

A lot of people talk about how poor the storage performance on AWS is - but this seems to offer provisioned IOPS up to 30,000 IOPS.

I'm curious what sort of hardware/setup that translates to in the real world? Do you find your own dedicated setups have more throughput?

And there doesn't seem to be much info on the network bandwidth between RDS and EC2 either.

taude · on Nov 14, 2013

We've moved off AMZN for more custom hosting. But this is a big win for open source devs. More deoployment options is always a good thing.

LVB · on Nov 14, 2013

I've only ever used S3, but some of these AWS offerings do look interesting for my little projects. My question is, how do hours get calculated for billing? If I had my super low traffic blog using RDS, would I incur a few microseconds of time per DB hit, or is it rounded up to an hour, or is it the total time the DB is available period?

dangrossman · on Nov 14, 2013

You would be paying for 24 hours each day. You are renting a managed virtual server, and you pay for each hour this server is running in addition to usage fees for storage and IO operations when it's actually being accessed. It'd be $20-30/month to host the database for a very small blog.

chrismmccomas · on Nov 15, 2013

To be fair, if you qualify for the AWS Free Usage Tier, you could run a small blog with RDS for $0 for the first year.

benmathes · on Nov 14, 2013

Oh sweet, now I can actually put this up for less than $ArmAndALeg https://github.com/benmathes/earthquakes

workhere-io · on Nov 14, 2013

Note that AWS' PostgreSQL offering is more expensive than its MySQL counterpart.

http://aws.amazon.com/rds/pricing/

adenot · on Nov 15, 2013

something wrong with the 'reserved' prices?

PostgreSQL RDS micro instance falls to $0.009 per hour when reserved, while MySQL falls to $0.016 per Hour.

If it isn't a typo, Postgres reserved instances are 1/5 of the on-demand price, which doesn't seem correct.

gfodor · on Nov 15, 2013

up-front price is largely different between the two. psql looks to be 50% cheaper when reserved for 3 years. weird

gfodor · on Nov 14, 2013

Oh dear god yes. I just rigged up a PostgreSQL instance on Ec2 for a new project, so excited to shut it down and let amazon deal with it. Hell yes.

nathanwdavis · on Nov 14, 2013

It doesn't mention which version of PostgreSql. I hope it is 9.3. It's got to be at least 9.2 since it supports the Json datatype.

wvanwazer · on Nov 14, 2013

It is 9.3.1.

gingerlime · on Nov 14, 2013

But it's strange that the page says that PostgreSQL includes support for ‘JSON’ data type and two JSON functions

In PG 9.2 there were only two json functions[1], but 9.3 introduced more[2]

[1]http://www.postgresql.org/docs/9.2/static/functions-json.htm... [2]http://www.postgresql.org/docs/9.3/static/functions-json.htm...

adityamenon · on Nov 15, 2013

Oh, cool stuff! I'll look into using this with the Openstreetmap deployment I'm building.

pkmishra · on Nov 14, 2013

This is so fucking awesome. I have been waiting for this since long time.

sjtgraham · on Nov 14, 2013

FINALLY!

lazyant · on Nov 14, 2013

Funny how it's a tiny bit more expensive than the mysql counterpart

chenster · on Nov 14, 2013

Are they ever going to support DB2?

tomphoolery · on Nov 14, 2013

This is so nice. Thank you Amazon!

eonil · on Nov 14, 2013

Finally!

sillysaurus2 · on Nov 14, 2013

Will anyone please explain the tactical reasons why PostgreSQL won? It's pretty obvious it has. I've basically ignored the database wars for a few years, so it's kind of interesting to see that everyone's using PostgreSQL now.

jes5199 · on Nov 14, 2013

Some engineers I worked with went and interviewed people at various San Francisco startups about their experiences with their databases.

The MySQL startups tended to say "We love MySQL. We've gotten in the habit of taking an hour or two of downtime in the middle of the night every week to run all of our schema migrations, and we've had to build our process around that, but one we had it in place, everything's been fine."

The PostgreSQL startups said "We love PostgreSQL. We run schema migrations in real-time during the middle of our workday, and we don't have any problems."

ceejayoz · on Nov 14, 2013

pt-online-schema-change from Percona addresses this particular point with MySQL nicely for us, incidentally. We use their Percona XtraDB Cluster fork and are quite happy.

daniloassis · on Nov 14, 2013

I guess we are talking about RDS options though :)

ceejayoz · on Nov 15, 2013

Entirely possible to use pt-online-schema-change on an RDS instance. It's just running a bunch of SQL commands and is very happy to do them against a remote database like an RDS instance.

gtaylor · on Nov 14, 2013

But that's not included in mainline MySQL, correct?

ceejayoz · on Nov 14, 2013

The Percona Toolkit works with mainline MySQL and is trivial to install.

gtaylor · on Nov 14, 2013

I guess that means "No, it's an unofficial, third party toolkit".

ceejayoz · on Nov 14, 2013

So's jQuery, but that hardly stops me from using it. Unsure as to the point you're trying to make.

Iftheshoefits · on Nov 14, 2013

For starters, the requirement of a third-party library introduces at least two problems:

1. An additional potential point of failure 2. The core software (a DB, in this case) can (and probably will) evolve independently of the third-party tool--thus introducing an additional layer of maintenance problems.

I'd argue further--and this is of course just an opinion--that such a basic feature as this ought to be supported out-of-the-box by anything that claims to call itself a "database" in the sense that MySQL does.

oceanplexian · on Nov 15, 2013

Percona and pt-online-schema-change is not just "some third-party library".

It's an industry-standard tool and one of the most respected forks of MySQL. It's not a layer of maintenance problems and they sell commercial support.

Their customers include the BBC, Yelp, and Cisco: http://www.percona.com/about-us/customers

Also, for the record, Oracle added online schema changes in 5.6.

gtaylor · on Nov 15, 2013

So you actually have to use a fork of MySQL to get online schema changes? I was thinking this was just a drop-in of some sort.

gtaylor · on Nov 14, 2013

There is no "point". I asked a question and you didn't answer it directly.

smileysteve · on Nov 14, 2013

Very important to note that MySQL 5.6 has ONLINE ALTER.

Sanddancer · on Nov 15, 2013

Looking through the documentation, though, it's even worse than just saying every alter is an offline alter. There are a few, but significant, schema changes that one could want to make -- like when dealing with a table that has full text search -- that need to be done offline; you have to tread real real lightly in dealing with online alters. I'd personally argue that online alter is not ready for prime time with 5.6; there are too many other things they need to do with the database engines to make it really useful.

maaku · on Nov 14, 2013

Is it transactional?

dangrossman · on Nov 14, 2013

https://blogs.oracle.com/mysqlinnodb/entry/online_alter_tabl...

nphase · on Nov 14, 2013

Which tools enable real-time schema migrations in Pg?

gtaylor · on Nov 14, 2013

CREATE INDEX CONCURRENTLY, ALTER, and others. Batteries included.

This has saved our bacon a number of times. The only kind of DB-related downtime we have is when we're doing a Postgres upgrade.

chc · on Nov 14, 2013

I don't know if I would really say Postgres won. Among people who care about databases, Postgres has always been the more popular of the big two free databases. Postgres was actually dedicated to being a good relational database, whereas MySQL in its earlier days was willing to cut corners to the point where it wasn't even ACID in most common configurations. But MySQL was faster (because, again, cutting corners) and its tooling was a little more friendly (especially for the PHP crowd).

Over the years, Postgres has made up those shortcomings, so it retains its respect among database wonks and now other people can easily use it too, so you see a lot of people adopting it over the past few years.

But in overall usage numbers, I would be surprised if it were anywhere near MySQL. It takes a long time to overcome that kind of inertia.

nerfhammer · on Nov 14, 2013

don't forget postgres didn't officially support replication until late 2010, which was a total non-starter for many/most production uses

gtaylor · on Nov 14, 2013

It is a bit surprising how long it took for that to land. On the plus side, replication is very easy and low latency with 9.3.

There's still a bit to do on the auto-failover front, but that may end up being more of a third party undertaking. Postgres has failover facilities included, they just have to be driven by something external right now.

jeltz · on Nov 15, 2013

We ran PostgreSQL in production before 2010, and there were several solutions back then for replication but all of them had their own set of problems. We went with shipping the write-ahead log with rsync to the standby server, but this was trickier to set up and had much worse latency than the current built-in solution (which is basically streaming the write-ahead log over a tcp connection).

chc · on Nov 14, 2013

Yeah, I think that was the big hurdle that finally made Postgres acceptable to people and started the wave of switchers.

Sssnake · on Nov 15, 2013

That is absolutely ridiculous. Less than 1% of 1% of mysql users are using replication. Suggesting it is a "total non-starter" for most production uses is crazy.

ceejayoz · on Nov 15, 2013

> Less than 1% of 1% of mysql users are using replication.

If you count all the crappy shared hosts, XAMPP local installs, and hobbyists setting up their own little VPSes, perhaps.

Using a single, unreplicated database instance in production for anything serious is bizarre. Failed hardware is hardly unheard of.

Sssnake · on Nov 15, 2013

>Using a single, unreplicated database instance in production for anything serious is bizarre

It is incredibly common. Go check out a thousand businesses running mysql, you'll be able to count the ones using replication on your fingers.

>Failed hardware is hardly unheard of.

You don't need to use mysql replication to deal with that. Even the crappiest low end SAN storage devices do it vastly better than mysql does, without any of the bugs and problems mysql replication has.

jeltz · on Nov 15, 2013

I have heard stories of entire SANs failing so I would not advice trusting your SAN too much. For example there was a huge outage at a Swedish host caused by a failed SAN, all data was lost so people had to use daily backups to restore.

Replication can be done off-site so you at most lose a couple of seconds worth of data. I do not know anything about MySQL's replciation but my trust in PostgreSQL's is very high.

Sssnake · on Nov 16, 2013

>I have heard stories of entire SANs failing

Which is why it is being replicated, like I said.

oceanplexian · on Nov 15, 2013

You're kidding right?

I'd wager that a huge number of MySQL users are using replication. Running a single database in a production environment is totally unacceptable and a major business continuity problem.

whatusername · on Nov 15, 2013

How many wordpress installs use replication?

Sssnake · on Nov 15, 2013

>Running a single database in a production environment is totally unacceptable and a major business continuity problem.

You don't need to use mysql's broken replication to get HA. Hell, I've seen more people (wisely) using DRBD for that than using mysql's replication. But even entry level storage devices do replication.

craigkerstiens · on Nov 14, 2013

I've written a couple of blog posts about what makes Postgres a great database, many of these reasons are why its now preferred by so many devs - http://www.craigkerstiens.com/2012/04/30/why-postgres/

avenger123 · on Nov 14, 2013

I wouldn't say PostgresSQL has won. MySQL is still very relevant.

I would say that if you are just starting out and you already know MySQL and you are proving out your MVP, MySQL is still relevant.

Once you have a stable product and you need a better database, PostgresSQL is a good move up.

jon-wood · on Nov 14, 2013

Given the number of companies who'll provide you with a fully managed Postgres installation there's no reason not start out on it for the MVP - as a user its no harder than MySQL. Even as a sysadmin its not significantly harder, just different to work with to MySQL.

jeffdavis · on Nov 14, 2013

Tactically, because PostgreSQL predictably releases robust software every year with major improvements and innovations.

Strategically, because PostgreSQL is not just trying to be a free database checking off features. It's trying to be something better -- lots of innovation that is having a bigger impact on what developers and DBAs can do.

kvinnako · on Nov 14, 2013

One of many reasons is MySql being bought by Oracle and name change/lost of community backing etc...along with some missing advanced technical features....

ancarda · on Nov 14, 2013

I still use MySQL because trying out PostgreSQL recently scared me away. Maybe I was trying the wrong thing but I couldn't get pgAdmin to display my tables. I think I sank most of my afternoon trying unsuccessfully to do something MySQL can do in 5 minutes.

I recognize I'm used to MySQL but I was under the impression that because PostgreSQL has a similar syntax (SQL) it wouldn't take too long to pickup.

mixmastamyk · on Nov 14, 2013

Unfortunately things that take 5 minutes in a complex application typically required an order of magnitude larger investment up front in manual reading and experience.

gtaylor · on Nov 14, 2013

I deal with Postgres a good bit, but will be the first to tell you that pgAdmin has got some issues. It's a very clunky, buggy, moody piece of software.

I've had better luck learning how to use the psql command/shell than mess with pgAdmin, at least in certain cases. To take your example of displaying tables, psql in and type: \dt

The rest is just a search away.

KingMob · on Nov 14, 2013

I have idle daydreams where Sequel Pro is finally ported to Postgres. pgAdmin is horrendous.