Deep Painterly Harmonization

extralego · on April 12, 2018

Congrats. This is very cool.

This is probably less a critique of your project and maybe more a celebration of cubism: The clock in the Picasso portrait is not "harmonized" in terms of cubist motifs at play in the composition. The contours appear merely dilapidated. Maybe this passes for someone with little concern for painting or art history but it was like a sort thumb for me. Maybe this is a good study reference on the gap between pattern recognition and human perception. Maybe not. I'd be interested in other thoughts.

jerf · on April 12, 2018

Plenty of them fail to stylistically harmonize even if they visually harmonize. Of course, in some cases, it's debateable whether stylistic harmonization is even possible at the human semantic level. The bicyclist on the bridge in the scream painting was quite bad, and the Eiffel Tower in the cityscape was noticeably degraded by the fact the painting didn't have the colors that the artist would probably have actually used, so it's rather bluish.

What's more impressive is that several of them work quite well, and barring anachronism, at the sizes given in the samples you might not know what was added if they didn't point it out. The park bench I found particularly impressive, for instance. I suspect if we increased the resolution it would break down, but at this size it works pretty well.

macawfish · on April 12, 2018

I was very impressed by the one with the large book added next to the person in the grass. The curvature of the pages changed to be more gentle.

extralego · on April 12, 2018

Yeah good point about the resolution. So I wonder what effect lowering the resolution would have?

In general, it’s definitely impressive how well they work.

jerf · on April 12, 2018

I've measured the "realism" of graphics in terms of resolution for many years now, because nowadays we generally have realistic graphics, as long as the resolution is small enough. (Compare with, say, an NES game, where no matter what you do to that image nobody will ever find it realistic.) There's a lot of techniques/artists/etc. I've seen that are perfectly realistic looking at 320x200 (say), but couldn't sustain the illusion of realism at 1920x1080. This way I get a reasonable metric of whether or not graphics are "photorealistic", or in this case, match the style of the painting.

It would be interesting to see them do a higher resolution version, or if they have them. It's possible it would indeed fit right in even so; now that I think about it the microscale of these paintings are probably very stereotypical and the algorithm might be able to reproduce them well. It would be especially interesting to see if it would correctly reproduce brush strokes, which have a lot of context to them. Given that "deep learning" can do things like reproduce the structure of a TV script accurately, it doesn't seem out of the question.

teenbear · on April 12, 2018

I think the gap in this case is not as wide as you think given that it is only looking at this one image. Had the network been trained for one art style in particular it may be able to pick up on that detail similar to if a person had studied many paintings in that style.

dahart · on April 12, 2018

Way cool. I would personally mistake many of the results for a real painting if I didn't have the original in front of me.

Getting the colors right is half the problem, and I guess the histogram loss function I saw mentioned at the bottom of the page does that. A couple of the results had strange looking color transformations though - the little girl whose red shirt turns blue, even though the bed has some red in it, and the red rose that turns yellow even though there are orange flowers in the background. So it's not always choosing colors that are the closest to the source while being available in the target's palette. Anyone know why the colors sometimes go so far of course?

Also, this is one of the first style transfer papers I've seen that has a pretty obvious built-in and seemingly plausible business idea. I'm sure poster stores in malls everywhere could sell versions of your favorite painting or poster with your or your own face or something else of your choice added to the composition. It's like the new version of the picture board painting with face cutouts, but way better.

azinman2 · on April 12, 2018

Mr Bean as Mona Lisa or Einstein on the $10 bill are perfect examples of that.

sagacity · on April 12, 2018

With these examples it always feels like a very good usecase would be automatic rotoscoping. A movie like "A Scanner Darkly" looks like it could've used these techniques, instead of having all the original footage redrawn manually.

RobLach · on April 12, 2018

That’s a better case for typical style transfer than this.

pizza · on April 12, 2018

Looks like someone did just that!

https://www.youtube.com/watch?v=Rw0hZ_-tztk

akavel · on April 12, 2018

Why would they need/want to add the Disclaimer? ("This software is published for academic and non-commercial use only.") By fear of some patent violation? Or do they plan for commercial use (e.g. licensing to Photoshop) and this way they keep the license non-OSS (a.k.a. the "share source" euphemism https://en.wikipedia.org/wiki/Shared_source)? Is it some kind of a well known legalese in academic circles?

samfriedman · on April 12, 2018

Seeing as two of the authors are from Adobe Research, I'd imagine this IP will be used for a Photoshop filter in the future.

johndough · on April 12, 2018

They use VGG-19 pretrained weights which are derived from ImageNet which consists of mostly copyrighted images. It is currently untested whether this is a legal problem or not.

akavel · on April 15, 2018

Wow, ok, this is seriously insightful and mindblowing for me! I'd never think of such a danger vector, but with your hint I can now totally understand the reason for caution. However painful it feels. Thanks for the reply! :)

adrianN · on April 12, 2018

Maybe formally their institute owns all IP they produce and they can't make it available unless it's for academic use?

eXpl0it3r · on April 12, 2018

Maybe they simply don't have the rights to the whole source code, but were allowed to make it public if they added that disclaimer. Usually the university will have the ownership of code produced in academia and if you find a commercial model based on their research code, they most likely want some money from that.

failedartifact · on April 12, 2018

Is the disclaimer clearly just saying commercial use is prohibited?

midgetjones · on April 12, 2018

Great examples (after I realised they weren't all going to be McDonald's ads)

isp · on April 12, 2018

My favourite: adding a Star Destroyer to A Starry Night. ("What should we add to A Starry Night?" "What else begins with 'A Star'... aha!")

theon144 · on April 12, 2018

No kidding! After the first couple I started only looking at the final image, trying to guess the inserted object.

edu115 · on April 12, 2018

These are the same authors from this https://news.ycombinator.com/item?id=13958366 which was posted a year ago. Really interesting work they are doing.

bguberfain · on April 12, 2018

If a prize for best examples exists, this would win the first place!

isp · on April 12, 2018

Paper: https://arxiv.org/abs/1804.03189

JorgeGT · on April 12, 2018

Knowing that mankind's scientific record now contains a Gioconda with the face of Benedict Cumberbatch has very much brightened my day.

alephnan · on April 12, 2018

How long until it becomes cannibalized into a frivolous Snapchat selfie filter?

IshKebab · on April 12, 2018

As opposed to these totally not frivolous examples?

sp332 · on April 12, 2018

The easy use cases come first.

kalal · on April 12, 2018

Consider for example a picture of a street and a rendered car overlaying it. It would be interesting to see how well the technology bakes in the car so that it matches the environment. This would go in direction of realistic rendering.

spectaclepiece · on April 12, 2018

Anybody have any idea why gen_all.py is python and filt_cnn_artifact.m is written in Matlab? The latter seems easy enough to write in python as well.

Is there something about what this file does that is easier accomplished in Matlab or is it just two different people preferring different languages?

amelius · on April 12, 2018

Does this contain pretrained data? Is there a possibility to train it yourself (e.g. to allow a broader set of styles)?

judah · on April 12, 2018

This is really amazing. Very impressive. Would love to see this published on the web somewhere to try it out quickly without requiring folks to have the necessary components to build it.

amelius · on April 12, 2018

It would be nice to have some more controls.

For example, often the color of the inserted object changes radically, which might not be what was intended.

thanatropism · on April 12, 2018

The first example that's really abstract is awesome.

I'd love to see more things like this with Rothko, Pollock, etc.

Nikita_Sadkov · on April 12, 2018

Guess it could be used to create whole movies or video games in that painter's style.

amelius · on April 12, 2018

I guess that would be more difficult, because there has to be coherence between subsequent frames (i.e., it has to avoid wildly changing random effects between frames).

r34 · on April 12, 2018

So I assume no CUDA (wrong grpahic card) = no possibility to play with this tool? :/

azeirah · on April 12, 2018

You could use a server, I guess?

danielvaughn · on April 12, 2018

This is amazing! Wow, well done.

artur_makly · on April 13, 2018

So has anyone implemented a web-interface for it yet?

r34 · on April 12, 2018