The Lava Layer Anti-Pattern (2014)

lliamander · on April 12, 2018

I sometimes see legacy systems as old cities:

- The classic old town (the older parts of the system that tend to use older coding practices and technologies, but have most of the bugs stamped out and more or less "just work")

- The slums (the parts that tend to be bug prone, but are impossible to fix. I.e. no one wants to touch that code)

- The apartments/row houses (parts of the code that involve lots of the same types of objects/classes/modules that follow a similar pattern)

- The art district (the place where someone tried some odd/experimental libraries or code patterns)

- The residential district with windy roads and lots of courts and loops (places in the code where there are lots of objects calling each other with really deep stack traces; easy to get confused where you are when debugging)

Legacy systems are bound to have inconsistencies, but as you become more familiar with a system you begin to notice localized instances of consistency. These localized instances of consistency become the "districts" in our mental map of the system.

To my mind, if you're working on mostly new code (within a large existing system) unless it is already a highly consistent code base I would just use whatever conventions make the most sense to the developers working on it.

lainga · on April 13, 2018

Ah, I see: Le Corbusier just wanted to rewrite it in Rust.

titanomachy · on April 13, 2018

I love this. I think I might add some tour-guide comments to our codebase.

caf · on April 13, 2018

"You probably shouldn't leave any private methods parked near here."

Groxx · on April 13, 2018

"private methods are routinely broken into and used for nefarious purposes"

that'd describe quite a lot of code I've seen.

caf · on April 14, 2018

"Over there is our haunted module - it's been empty for years, and the callbacks aren't even wired up anymore, but people swear they've seen breakpoints triggering in the upstairs windows during late night solo debugging sessions"

dredmorbius · on April 13, 2018

I really, really like this.

I've long seen microprocessors and/or circuit boards as graphically (and functionally) similar to cities.

cwilkes · on April 13, 2018

Last year’s failed rewrite attempt that caused half the team to quit can be that part of town swallowed by a sink hole.

rdsubhas · on April 13, 2018

And then there is the highly guarded government/beaurocratic district (deployment scripts, workflows, jobs, configs which are as much part of the system as the code)

moring · on April 12, 2018

I often feel lost when reading such articles because they never seem to quite acknowledge how bad legacy systems can be. The following examples all originate from the same project I have worked with in the past.

Example 1: You cannot "favor consistent over new/better" because one of the main problems with the old code is that it is horribly inconsistent already.

Example 2: You cannot "favor consistent over new/better" because the legacy style you would try to be consistent with is so bad that you cannot even understand tiny fragments of the code, let alone write new code in a consistent way.

Example 3: The legacy system was built in a way that makes it impossible to store the code in any kind of VCS, so being consistent with that means breaking well-established best practices.

Example 4: For some part of the system, nobody knew how to make the magic code generators produce code that is consistent with the legacy code (and if you try to write that code without the generators, you are in for a trip to hell).

That said, the article helped a lot in that I now know the name for a problem I somewhat recognized but couldn't describe well.

hinkley · on April 12, 2018

    The legacy system was built in a way that makes it impossible to store the code in any kind of VCS

Come again?

titanomachy · on April 13, 2018

It's not stored in text, or even in files at all. The program exists only in the form of holograms encoded into crystals which can only be altered by singing at a precise frequency.

marcosdumay · on April 13, 2018

I've seen:

- The program can only be compiled in a particular Eclipse environment, that is copied from one machine to the other. Every attempt to replicate the environment from a plain Eclipse installation fails.

- Building the program depends on specific versions of system executables. Those versions are long outdated.

- The program depends on specific and very complex system settings for building and running. Those are comparable in size to the source code.

And I mostly do not work with legacy systems.

EDIT: Oh, and there is the obligatory "90% of the executable source lives on the database" kind.

Arbalest · on April 13, 2018

Binary formats, such as ladder logic in control systems, as an example. Which also typically can only be edited by a first party program.

jrimbault · on April 13, 2018

My experience: a small-ish size php application made originally by 2 autodidacts who didn't dig too far into programming and the tooling around programming (weirdly enough, I found some pretty advanced SQL in there, while it seems it didn't bother them to write the exact same request 7 times 5 lines apart from each other).

There were/are files scattered all accross the filesystem with wild include statements both at the top of each scripts and in the execution logic. The scattering was/is so bad with mixed data files with logic files, dependencies (which were not registered as such, and were modified from their original source) thrown between original scripts.

I've started versioning a year ago when I got there, and I only got to _good_ state last month, trimming and trimming.

Just the fact that files were scattered accross the filesystem, while not constituing big enough clusters to warrant separate repositories, meant I had to make one large repo with weird ignore rules.

adrianratnapala · on April 13, 2018

VCS requires that projects build from source code that is more or less immutable except when humans really want to change it. But some projects mutate in machine-dependent ways by the build system. Bonus points when said data is binary. I've seen CMake caches and Eclipse project files go wrong this way.

Another good one is where your development project is testable only against a real database, and the schema keeps changing. Even if you could revert the schema file, you can't revert the data.

Terr_ · on April 13, 2018

Maybe it's in something like Smalltalk?

floaterpig · on April 13, 2018

Another alternative is one of the PICKs where the source code is (used to be perhaps) stored in the database, or I think COBOL on some systems didn't exactly lend itself to flat files from the little I saw (or at least the OS wasn't embracing CVS/SVN/Git in favour of some hugely expensive and utterly inferior product...).

Terr_ · on April 13, 2018

Ah, right, or "business logic eval()ed from database text" systems.

jupp0r · on April 13, 2018

Or, more commonly - stored procedures.

Terr_ · on April 13, 2018

Stored procedures are a little easier to handle provided your deployment system does a wipe/replace, similarly to overwriting scripts/executables.

I was thinking of a grimmer scenario, where the system has an disquieting aspect of polymorphic, run-time code editing... central to its "flexible" production behavior. There are more things in heaven and earth than dreamt of in sane philosophy.

fiddlerwoaroof · on April 13, 2018

Smalltalks (at least Pharo) have integrated version control

emmelaich · on April 13, 2018

Nothing is impossible of course because it is all bits.

But perhaps they are using something like Oracle BI where you assemble some blob in a UI on Windows then upload to the server.

You could commit the blob to VCS. Or maybe serialize it to XML (supported in some versions).

But try and convince the DBAs doing development to do that commit. Good luck.

mikekchar · on April 13, 2018

Ha ha! I went through a phase of my career where I thought it would be cool to write code generators that took input from the BNF in RFCs. Luckily I was just smart enough to declare that code generation was a one time only event. Current me would have been very worried about how to survive young me's creative programming spirit ;-)

theptip · on April 13, 2018

> they never seem to quite acknowledge how bad legacy systems can be

Isn't that the ground truth that everybody starts with though? My general perception is that everybody "knows" that legacy systems are bad, hates them, and doesn't want to work with them. I view the article as attempting to moderate that base position and inject some caution by highlighting often-unforeseen costs to certain approaches to attempting to replace legacy code.

I think the OP would respond to all the cases you're referring to by advocating either a full rewrite (if they really are as bad as your observations above), or leaving the code alone (if the cost of a full rewrite or carrying a partial rewrite is higher than the cost of living with the old code).

incompatible · on April 13, 2018

I get lost with all the Microsoft / data modelling terminology, it doesn't have much to do with software I've ever had anything to do with. What is all this "data-access layer" abstraction stuff about anyway?

flukus · on April 13, 2018

It's based on the (false) belief that 2 section of the code base accessing/updating the same data will be performing the same function so it should be in one place. This one place is the data access layer and as more exceptions to this belief more and more "business logic" seeps into the data access layer. Then your business layer is anemic because most of the "logic" is moved to the data access layer and it's just an unnecessary pass though from the application layer.

Some places will be very strict about this design pattern and require the application layer make multiple calls to the BLL and DLL, typically resulting in a problem known as n+1, where the application layer is making 10's - 1000's of separate calls to the database per request instead of just doing it in an inner join.

Advanced implementations will have these layers physically separated on different machines in the mistaken belief that more CPU will fix this performance problem, modern implementations will call them microservices. Of course nobody measures the impact. By this point any hope of transactionality is lost.

So instead of "messy" inline sql or ORM calls you end up with 7 layers doing nothing between you and the database and showing 10 items in a list on a web page takes 54 seconds.

And that ladies and gentlemen of the jury, is why I didn't brake when I saw the software architect crossing the road.

incompatible · on April 14, 2018

That's for that. I was thinking maybe it was just different versions of things like ODBC, but that sounds far worse. The database is already supposed to be the "data abstraction layer", and should be perfectly capable of handling multiple requests from different places.

I suppose software that just passes around database handle(s) would be a work of barbarism to these people.

taneq · on April 13, 2018

This just sounds like the inner-platform effect when applied to database applications. :/

flukus · on April 13, 2018

Inner platform effect is different but often teams will do both.

The inner platform effect is usually data driven in a "we'll just code this once and generate dynamic code" sort of way. The dream of the inner platform effect is often to automate the production of these 23 useless layers.

bschwindHN · on April 13, 2018

You just described my previous workplace a little too well...

hinkley · on April 13, 2018

Your architecture may say exactly all of the qualities of your data are but when it comes to transferring it into and out of a database some of the fidelity is lost.

If you're trying to pull the data into a statically typed language (even one that isn't object based) there's an impedance mismatch and you almost have to trick your language into accepting the data. So you hide this indignity in something called an Object Relational Mapper, and the rest of your app just pretends like you have a database that gives you Objects.

There are a million flavors of these with their own jargon but it's all the same stuff, and they are all within an order of magnitude of each other on the awfulness scale (which is why I'm not to enthusiastic every time someone comes up with essentially the same thing under a different name. Like, say, Protocol Buffers)

DonHopkins · on April 13, 2018

Agreed, ORMs all suck because what they're trying to do is essentially sucky. But some are better at sucking than others by orders of magnitude!

https://www.sqlalchemy.org/

hinkley · on April 12, 2018

Print this out and post it on a wall where you see it every time you leave your desk or come back:

Refactoring is a bottom up process.

You make local changes, and those reveal the paths of least resistance in the code. Contiguous refactors start suggesting further refinements or even new features and it spreads and spreads across the app.

By the time you are making structural changes to the app, the avalanche should have already started and it is too late for the pebbles to vote. If that isn't happening for you then put it down to impatience borne of frustration with the rate of change.

One of the best ways I know to speed this process up without violating the 'rules' is to start with the build scripts and work your way through the initialization code of the app, chipping away at smells until it starts looking right. With good code to the 'left' you have a beachhead (and a line in the sand) you can use to push out into various subsystems improving as you go.

lmm · on April 13, 2018

It's the reality of a large codebase that there will be parts that followed the best practices of 5 years ago, best practices of 10 years ago, and so on; that's not an anti-pattern (indeed I'd be horrified if the code from 5 years ago wasn't noticeably worse than today's code - that would imply that the industry and the team had made no progress in the past 5 years).

What makes the single supporting example given for this supposed "pattern" bad is that it's full of churn, parts rewritten in a different way that wasn't better, just different. The idea that this is an antipattern relies on the fallacy that all choices are tradeoffs and there's no such thing as a better way of doing things. Whereas actually e.g. NHibernate is enough of an improvement over DataSet that an application that's half NHibernate and half DataSet is much, much nicer to work on than one that's all DataSet, despite the inconsistency.

The real antipattern in the story is making technology choices without team consensus/buy-in. If one developer adds a code-generation framework that only they can maintain, it should be rejected during code review. There's very little value in one developer unit testing on their own if the rest of the team doesn't care about maintaining tests. That's the real problem, and none of the suggestions in the article address it.

jknoepfler · on April 12, 2018

in the Mythical Man Month, Fred Brooks asserts that conceptual consistency is the single most important quality of a large, successful software project. Although I was skeptical at first (it seemed a little too believable to be true), experience has led me to believe that Brooks is correct. If a change makes a project conceptually incoherent, it should be rejected, and a new project started. I think this article is one very good illustration of this phenomenon, albeit in an agile rather than a waterfall setting.

As a corollary, I think people often use agile as an excuse for introducing conceptual incoherence. I think this is almost always a mistake, and represents laziness, short-sightedness, and immaturity on the part of devs and managers alike.

edit: I'll add that I think under-investment in quality principal engineers is what gets one in this mess in general. If you replace your architect and her copilot with a handful code monkeys and one or two arrogant senior devs, you get crap stew unless you're careful to manage around your team's lack of experience and maturity.

squiggleblaz · on April 13, 2018

I was thinking of making a comment here, describing this as precisely my experience. I work on an old legacy system. I have done so for some time, from junior to lead. [Edit: wait, I meant to say "i have discovered this same principle but you expressed it better than me". instead, i just described my experience.]

And I can see from experience that every refactor we did that was "this is the latest and greatest and we'll write the new feature like this and we'll just start migrating everything over to this eventually" has been an utter failure.

But the changes that were by their nature spread throughout the system automatically have been so much easier to work with. You don't even notice them. (Which makes it harder to get credit for it. If I have to train a new dev up today, I can say "the system is a little crappy, but we're trying to make it better". But they'll never see that we've made huge progress. But they will see the four different database abstractions we've got going on and they'll curse me, because this is where you see some work.

It is better to have a crappy core that you gradually fix, then do have a crappy core that remains there and five crappy other cores from each different dev. And whatever problems are caused by the shitty database abstraction some idiot dev created are always going to be there, so you might as well just live in that world.

jknoepfler · on April 13, 2018

i really appreciate your taking the time to write up your experiences in response, thank you. I think it's really important to continually articulate hard, possibly unpopular truths learned through experience. Otherwise how will our children's children have anywhere to look for guidance (lol... but seriously)

coconut_crab · on April 13, 2018

Consistency is also the reason why Java or C# are so popular for large system. I asked a Scala developer recently why does his company use Java but not Scala, despite their developer's familiarity with the latter. He answered that while Scala is fine for small teams, when we have 100 or more developers there will be factions (scalaz, cats etc...), and it's very hard to maintain consistency between systems. With Java there is only one true style so it's easy to scale with larger number of developers.

dang · on April 12, 2018

Discussed at the time: https://news.ycombinator.com/item?id=8772641

lkrubner · on April 12, 2018

Just curious, but why not automate your link to the previous discussion? Why does Hacker News rely on people like you to post links to previous discussions? If there has been 3 discussions of an essay, over the course of 8 years, why not list all 3 discussions, when someone posts it again in the year 2021?

djur · on April 12, 2018

This already exists -- the "past" link under the submission title.

danield9tqh · on April 12, 2018

I don't think the 'past' link conforms to current best design patterns. We should refactor it so that it automatically posts a comment on the article.

dang · on April 12, 2018

Someone caring enough to look and post the links is a high-pass filter for interestingness.

c22 · on April 12, 2018

Isn't the same link being posted over and over already such a filter?

dang · on April 12, 2018

Maybe of some kind, but not the same.

I'm reluctant to introduce mechanical postings for anything because it's important for the content here to have variety and not be predictable. And also to be related to the people in the community.

hekfu · on April 12, 2018

And thank you for this. Despite my gripes with it, HN is indeed a community,not a 'platform'. Thinks like this policy keep it this way

scarface74 · on April 12, 2018

I came into a position where my first job was to make a certain process scalable. The code had all of the smells of bad design - huge monolithic classes, in a huge monolithic solution with unrelated projects. The less mature me would have said this crap needs to be rewritten. Now I would like to consider myself more practical:

1. Form the monolithic repo and start taking out unneeded projects and classes and recompiling often.

2. Encapsulate the entire executable in a Docker container

3. Use AWS’s ECS, Fargate, and Autoscaling.

Now we have scalability.

For maintainability, every time you touch a part of the code, extract the functionality into a lambda microserve.

The code that never changes, doesn’t need to be touched and you slowly start decreasing the size of the monolith and it’s easier to find bottlenecks and make changes without affecting other parts of the code. Replace “lambda expressions” with microservices/separate modules etc as appropriate for your use case.

twic · on April 12, 2018

> For maintainability, every time you touch a part of the code, extract the functionality into a lambda microserve.

Funny, for maintainability, every i time i touched a Lambda microservice, i would integrate it into a single codebase.

mattnewton · on April 12, 2018

I like to think you are at the same company, doing a modern day version of this: http://kinosjourney.wikia.com/wiki/Three_Men_on_the_Rails_—O...

Spoilers: (really even if you aren’t into animation I promise this episode of kino’s journey is worth it and stands on it’s own, turn back now): the first man is polishing the tracks, the second man is removing them, and the third is laying down new tracks. Each was hired years apart, and with large distances and no communication, each does not know that the others continue to work behind them to undo their life’s work.

scarface74 · on April 12, 2018

The end goal is to get rid of servers completely. By keeping the services small you enforce a culture of small, loosely coupled, single purpose functionality. Especially helpful when you're either dealing with jr. Developers or developers who have been at one company for 10 years and never learned how to properly structure code.

zdragnar · on April 12, 2018

In my (albeit limited) experience, projects which consist primarily of lambdas suffer one of two problems:

- the code is awful, because lambdas were an excuse to keep bad developers in their own playpens (aka juniors and those who don't learn)

- the code is just fine, and would be easier to maintain if most of the lambdas were combined back into one or more "monoliths"

scarface74 · on April 12, 2018

I agree, if you don't have junior developers (or worse outsourced developers), it is easier to maintain a well constructed monolith that has separate, focused modules, and clear boundaries between the modules. Microservices help contain the damage of bad programmer skills.

zdragnar · on April 13, 2018

I guess my point wasn't that damage containment was a perk. I view it instead as a lack of support- either cultural or structural- from the more experienced developers.

Microservice / faas may help mask it, but you're still stuck with a lot of bad code... Only now, there's less oversight and / or accountability.

scarface74 · on April 13, 2018

Yes but the biggest issue with maintaining "bad code" is it's brittle. You make a change one place and it breaks something else down stream. With a microservice, the invariants are well known and the boundaries are clear. It's easy to know whether you are introducing a breaking change and just create a new version at a different endpoint.

zdragnar · on April 13, 2018

You're not wrong, but I think missing my point. This is how we end up with a hackathon golang microservice saving a company 50k a month when a simple correction to poorly written code would have done the same thing.

Granted, I only skimmed bits of that article, so I'm not sure those were the exact details, but it makes for a nice analogy.

barrkel · on April 12, 2018

Irreducible complexity doesn't go away, and needs orchestration no matter how loosely coupled the parts are (which usually turns out to simply be a conversion from control flow to data flow, with little else added).

scarface74 · on April 12, 2018

There's a difference between irreducible complexity and 9000 line "Manager" classes. When you're stuck with either outsourced developers or a bunch of local contractors - neither of which are worth training since the time investment will be wasted when they leave - it limits the damage.

Microservices also helps if you have a large project with a lot of developers.

jupp0r · on April 13, 2018

Interesting article, but it seems to leave out some really important points:

1. Tests

I really like Michael Feathers definition of legacy code being code without automated tests. I've worked with some pretty bad legacy code bases, but the ones with decent test coverage were mostly easier to change and to refactor in smallish steps.

2. Documentation

Good documentation describing the reasons for the architectural decisions made (and, somewhat more importantly the reasons why other ways of solving the same problem were dismissed) could have prevented most of the bad choices made in that story.

3. Management

Where was management in the story? They should have seen the red flags (high turnover on that project, ...), done at least exit interviews with people leaving, seen the risk of the tech debt in the code base and taken appropriate action.

flukus · on April 13, 2018

At a previous job I inherited a steaming pile of legacy that needed improvement (it was broken and clients were complaining) I went with an approach of being explicit about new layers with classes like CompononentV2. This is an often maligned approach but I found it works quite well.

Basically all new code get's written against V2, old code get's slowly migrated as requirements or opportunity allows and sooner or later you hit a point where only a few places are referring to v1 so you can bite the bullet and remove them entirely.

Semantic versioning like this within a project get's beat out of us early as something source control should handle, but source control doesn't handle long slow migrations of code to new layers.

hamilyon2 · on April 13, 2018

That article is classic worth rereading. In my view, lava layer antipattern is lack of architecture. A situation like this could helped by assinging single role to make architectural decisions and maintaining documents describing them.

Another aspect is despite good intentions, an urge to use newest techniques and disregard for bigger picture signals lack of seniority. Everyone loves to use latest tech, but it takes courage, experience and confidence to slowly improve big project without using "latest and gratest". You need someone really experienced in charge to do that.

jammycakes · on April 12, 2018

> TL:DR Successive, well intentioned, changes to architecture and technology throughout the lifetime of an application can lead to a fragmented and hard to maintain code base. Sometimes it is better to favour consistent legacy technology over fragmentation.

Nice idea in theory, sometimes impossible in practice.

A few years ago I came onto a project that had a very clearly defined separation of concerns, with a business layer, data access layer, presentation layer, and Entity Framework. This was resulting in a number of SQL queries that took over a minute to run and caused web pages to time out.

I ended up cutting right through the layers, bypassing Entity Framework altogether and replacing it with hand-crafted SQL. This ended up cutting down the query time from six minutes to three seconds.

TickleSteve · on April 12, 2018

Abstractions and hard-interfaces rarely result in increased efficiency.

Breaking through the barriers and merging layers often allows a more inefficient solution in the same way as denormalisation increases performance through ignoring the "rules".

jammycakes · on April 12, 2018

Well in the example I've just given they reduced query times from six minutes to three seconds. If that isn't increased efficiency, then I don't know what is.

The fact is that sometimes you have to ignore the "rules," because the "rules" were designed to serve a purpose that does not apply in your particular case, or perhaps never even applied at all in the first place.

The problem with trying to separate your business layer from your data access layer is that it's often difficult if not impossible to identify which concerns go into which layer. Take paging and sorting for example. If you treat that as a business concern, you end up with your database returning more data than necessary, and your business layer ends up doing work that could have been handled far more efficiently by the database itself. On the other hand, if you treat it as a data access concern, you end up being unable to test it without hitting the database.

You need to realise that software development always involves trade-offs. Blindly sticking to the "rules" is cargo cult, and it never achieves the end results that it is supposed to.

TickleSteve · on April 12, 2018

(I was agreeing with you, not down-voting you).

jammycakes · on April 12, 2018

My apologies :)

Incidentally I wrote a whole series of blog posts a while ago where I cast a critical eye over the whole n-tier/3-layer architecture and explained why it isn't all that it's made out to be.

https://jamesmckay.net/category/n-tier-deconstructed/

digaozao · on April 12, 2018

Man... I read your blog. It's all true. I find so hard to explain this to Jr devs. They read a lot of best practices, and take it as a religion. Lots of uneeded code is created.

Anyway, I would like more skeptical devs.

TickleSteve · on April 12, 2018

Not really a software anti-pattern, as it does not result in any particular software mechanism.

If anything, this is a development-process anti-pattern (not even software specific). Its also extremely obvious and non-specific so doubtful that its worth naming as an anti-pattern.

squiggleblaz · on April 13, 2018

If it's so extremely obvious, why does it happen time and time again? I think I've spent all of my time arguing against this process. It's bad. It's better to improve on a bad design, than to make a bad design worse by adding another design to it. But it's hard to notice that.

TickleSteve · on April 13, 2018

Because its extremely difficult to do something about it.

Its normal for people to change what they're working on, this is inevitable.

hamandchris · on April 12, 2018

There is some benefit to naming common problems to help discuss them, but this name strikes me as too clever / not immediately clear enough. "Bit rot" is a somewhat similar phenomena with a much better name.

kazinator · on April 12, 2018

Disagree with comments below blog. A Dilbert comic dated 2013 was not yet "classic" in 2014, and arguably still isn't. Classic Dilbert is 1994-ish vintage.

zaksoup · on April 12, 2018

Really, classic dilbert is Scott Adams blog posts about how Donald Trump is using hypnosis to control the electorate. What happened to that dude!?

DonHopkins · on April 13, 2018

Whatever went wrong with his brain, it happened a long time before Trump ran for president.

Scott Adams Poses As His Own Fan On Message Boards To Defend Himself: http://comicsalliance.com/scott-adams-plannedchaos-sockpuppe...

In April 2011, Scott Adams, creator of Dilbert, created an anonymous account at Metafilter, then proceeded to vigorously & furiously praise himself, and insult other commenters. It wasn't the first time he'd done this, but it was the first time he got caught. http://mefiwiki.com/wiki/Scott_Adams,_plannedchaos

Dilbert creator outed for using sock puppets on Metafilter and Reddit to talk himself up (he is also plannedchaos on reddit) https://www.reddit.com/r/comics/comments/gqzgx/dilbert_creat...

As far as Adams' ego goes … he has a certified genius I.Q., and that's hard to hide. -plannedchaos^H^H^H^H^H^H^H^H^H^H^H^HScott Adams https://rationalwiki.org/wiki/Scott_Adams

zaksoup · on April 14, 2018

Wow. I was just sorta giggling quietly at what seemed to be a mild case of blogs-about-weird-things. I had no idea how deep the rabbit hole went