MeshAnything – Converts 3D representations into efficient 3D meshes

modeless · 2024-06-21T05:54:04 1718949244

Nice looking results, hopefully not too cherry-picked. Every 3D model generation paper posted on HN has people complaining that the meshes are bad, so this kind of research is welcome and necessary for generated 3D assets to be used in actual games.

Weird custom non-commercial license unfortunately. Notes from the GitHub readme:

> It takes about 7GB and 30s to generate a mesh on an A6000 GPU

> trained on meshes with fewer than 800 faces and cannot generate meshes with more than 800 faces

p_l · 2024-06-21T08:54:25 1718960065

Honestly I like the faces limitation - use it as input for blender to further develop the model, not as final product.

xray2 · 2024-06-21T22:56:43 1719010603

I fear people who aren’t good with developing models will just use it as their base. Not bad I guess, we could use more low poly games.

jgord · 2024-06-21T06:52:42 1718952762

Certainly a lot of scope for this kind of thing .. people who do lidar scans or photogrammetry of buildings tend to end up with very large meshes or very large point clouds.. which means they need souped up PCs and expensive software to wrangle into some usable CAD format.

Its an area where things can be improved a lot imho - I did some work a while back fitting flat planes to pointclouds, and ended up with mesh model anything from 40x to 100x smaller data than the ptcloud dataset. see quato.xyz for samples where you can compare the cloud, the mesh produced.. and view the 3D model in recent browsers.

My approach had some similarity to gaussian splats... but using only planar regions .. great for buildings made of flat slabs, less so for smooth curves and foliage.

Applying their MeshAnything algo to fine meshes from photogrammetry scans of buildings would be of great benefit - probably getting those meshes down to a size where they can be shared as 3D webgl/threejs pages.

Even deciding on triangle points to efficiently tesselate / cover a planar region with holes etc, is basically a knapsack problem, which heuristics, monte-carlo and ML can improve upon.

fsloth · 2024-06-21T07:41:30 1718955690

If you want to show photogrammetric pointclouds of buildings potree db&algorithm is pretty good and if you don’t like the library for some reason it’s pretty easy to reimplement (potree.org).

You just dump the pointcloud to a hierarchical octree and at viewer end just download the nodes in your frusttum and voila.

There are other approaches but this wins hands down on usability/simplicity.

jgord · 2024-06-21T08:02:38 1718956958

Im quite familiar with potree and a big fan .. having hacked some of the internals and added features to my own custom version - so people can share annotations, measurements, save to cloud or export linework .. without writing code/custom html.

Also added code to import e57 cube panoramas ..

Still, I think if one can use ML to simplify a pointcloud or fine mesh .. then the data is much smaller and cleaner, easier to import to existing CAD tools etc.

spookie · 2024-06-21T06:55:39 1718952939

Could you go more in depth with your approach? Sounds really usefull for man made structures!

jgord · 2024-06-21T07:12:15 1718953935

ahh.. looking for investment, so reticent to talk about the internals.

Basically does a lot of matmulls and sampling to find good good planar fits...

Most of these sample datasets took an hour to compute, using 12 threads on a midrange CPU... should be doable in seconds if ported to a highend GPU.

librasteve · 2024-06-21T07:26:40 1718954800

suggest a nanopivot, make it work on Apple AI (if you haven’t already) and position as iPhone AI mesh

spookie · 2024-06-22T11:05:09 1719054309

Understandable! Thanks for what you've shared. I'm doing academic work on something that could leverage a digital twin, hence my interest.

There are many uses for this tech, particularly in less techy crowds that still make significant use of traditional photogrammetry.

Your solution, if local, could give a significant advantage over other products such as Polycam. Again, if local, you could allow for much bigger scans (wink for those doing architecture, particularly those in the restoration field). Anyhow, hope you get that funding!

toxik · 2024-06-21T08:16:54 1718957814

Surely you heard about surfels, sounds like a very similar approach.

jgord · 2024-06-21T10:57:43 1718967463

Looked up surfels .. I think they are different from fitting local planes - I was interested in the geometry, not the lighting.

But surfels seems like a clever idea, pre-dating view dependent NeRFs.

bhouston · 2024-06-21T11:55:27 1718970927

Definitely the best result for low polygon creation I've seen. Great job!

Still triangles rather than polygons, but we are getting closer.

The end goal should be:

1) Polygons, mostly 4 sided, rather than triangles.

2) Edge smoothness/creases to separate hard coders from soft corners. (Which when combined with polygons enables SubD support: https://graphics.pixar.com/opensubdiv/docs/subdivision_surfa...)

3) UV for textures that are aligned with the natural flow of textures on those components.

4) Repeating textures (although sometimes not) that work with the UVs and combine to create PBR textures. (Getting closer all the time: https://gvecchio.com/stablematerials/)

After the above works, I think people should move on to inferring proper CAD models from an image. Basically infer all the constraints and the various construction steps.

bognition · 2024-06-21T12:09:18 1718971758

why are polygons preferred over triangles?

mdorazio · 2024-06-21T12:22:22 1718972542

It's much easier and cleaner to subdivide quads to refine shapes when modeling. For example, you can split the quads along an entire edge to get a new clean edge for manipulation (ex. to bevel it). If you try to do the same with triangles, you get a jagged mess.

rrradical · 2024-06-21T19:56:38 1718999798

I believe also quads also deform better during animation.

olejorgenb · 2024-06-21T21:01:22 1719003682

This might be a naive/stupid question, but wouldn't it be relatively easy to merge triangles in the same plane into polygons automatically? (I suppose few triangles from this process would be in the same plane maybe?)

pennomi · 2024-06-21T22:39:34 1719009574

If they’re actually in the same plane, yes. But that’s rarely the case except in hard surface modeling.

Besides, merging them unintelligently still doesn’t necessarily form clean edge loops, which are required for good model topology.

neutrinobro · 2024-06-21T19:55:09 1718999709

They are not. Polygons are a terrible representation since unlike triangles they do not cleanly represent a unique planar surface. With more than 3 points you will always have an ambiguity (or several) about which (numerical) plane corresponds to the actual face. For some graphics applications this may or may not matter much, but it is very important for anything using the mesh for physical computation.

johnnyanmac · 2024-06-21T22:26:18 1719008778

Co-planar quads will always subdivide into two coplanar tris. That's the crux on why the modeling work flow works. The GPU is going to turn it into triangles anyway as long as a few fundamental rules with indices are up kept, so you're mostly getting the best of both worlds here.

Stevvo · 2024-06-21T18:56:22 1718996182

I feel like maybe CAD would be easier? You only need represent form/edges, rather than meet all the requirements that you have for using a model for games/rendering.

lmpdev · 2024-06-22T04:59:22 1719032362

Are you describing NURBs?

ramshanker · 2024-06-21T08:23:40 1718958220

I am all in for any development in this domain. Just to spread some sense of scale, We recently processed (manually) the point cloud scan of one of the (<1% of whole complex) working Oil Refinery. The total volume of point cloud was 450GByte. Our previous project of slightly larger scope was 2.1TByte.

So the scale shown in this paper feels like toys! Not undermining the effort at all. We need to start somewhere anyway.

For the same reason, I feel puzzled looking at Industrial scenes in Video Games. They are like 3 order of magnitude simplified compared to a real plant.

alexvitkov · 2024-06-21T11:02:59 1718967779

Real life castles were designed to withstand a siege, video game castles are designed to give off a castle vibe. Once you've achieved that you stop adding stuff, as anything beyond that just creates problems - you start killing performance, visibility starts to suffer, it's not clear what's interactive and what is decoration, gameplay starts to take a hit as the AI and player start getting stuck in the clutter, etc, etc...

Most people don't care as they don't have deep knowledge of how a castle or a power plant really functions, you only notice oversimplifications in media in the field you work in.

It's also very likely the designers and artists didn't have time to do much research, and the whole thing is based off a Pinterest reference board.

ramses0 · 2024-06-21T16:02:50 1718985770

A personal pet-peeve of mine is "movies that feature an airplane flying away and turning left off into the sunset..."

ThEy NeVeR AniMaTe The FlAps!!

It's like you'd have an animated motorcycle scene and they don't turn the handlebars or make the bike lean when going around a corner. Like, the graphics are _soooo_ good but then they make the danged plane turn and immersion breaks (for me).

torginus · 2024-06-27T09:36:58 1719481018

Let me just plug this criminally underwatched video:

https://www.youtube.com/watch?v=SBHPSmsrEC0

I have never seen flight discussed in such detail, especially by someone who is an artist, rather than a pilot or an engineer.

taneq · 2024-06-22T08:13:55 1719044035

In the same vein, any time someone plays an instrument that they don't really play, and their hands aren't moving to match the music. Or when the sound for a vehicle doesn't match the actual vehicle type - there was a CGI short film with a motorbike that was clearly a Yamaha MT-01 with its massive V-twin, and it sounded like a 600cc 4-pot rather than a tractor.

nkrisc · 2024-06-21T10:13:15 1718964795

> For the same reason, I feel puzzled looking at Industrial scenes in Video Games. They are like 3 order of magnitude simplified compared to a real plant.

Because they are games, not oil refinery simulators. They are typically intending to only convey a general sense of “industrial environment” and nothing more.

Do your models of oil refineries include the correct grass and other plant species growing in cracks in the pavement?

Bjartr · 2024-06-21T12:31:19 1718973079

> they are games, not oil refinery simulators.

That's an excellent point. I do feel compelled to mention the exception of oil refinery simulator games. Maxis (of SimCity, The Sims fame) made SimRefinery way back when.

https://en.m.wikipedia.org/wiki/SimRefinery

nkrisc · 2024-06-21T13:00:58 1718974858

Yes, if a game is in fact a refinery simulator I would expect it to have an accurate representations of oil refineries. But whatever the latest Call of Duty game is? It’s going to be a grey block environment designed for gameplay that then gets covered in industrial props and textures and called a refinery.

sendfoods · 2024-06-21T11:55:46 1718970946

Could you go into some details here?

- sensors used

- postprocessing

- registration algorithm(s)

Are all things that would interest me greatly :)

CyberDildonics · 2024-06-21T12:31:37 1718973097

I feel puzzled looking at Industrial scenes in Video Games. They are like 3 order of magnitude simplified compared to a real plant.

Really? You don't know why video games don't have 80 billion points and you don't know why a tool made to simplify meshes into video game objects isn't using your 80 billion point lidar scan?

For starters, these are meshes and you're talking about points. If anyone is meshing those points and they have any sense, they are working with "toy" sized chunks too so they avoid doing nearest neighbor calculations on terabytes of data.

wildpeaks · 2024-06-21T06:40:11 1718952011

Calling AI-generated meshes "Artist-created" just because it aims to look similar as human-made ones is misleading.

spookie · 2024-06-21T06:57:10 1718953030

I think it comes from the fact an artist can create the mesh themselves and optimize them using this approach.

obsoletehippo · 2024-06-21T10:01:37 1718964097

I like how the Social Impact paragraph notes reduced labor costs, yay! Not e.g., reduced need for artists, so you're all out of a job.

bee_rider · 2024-06-21T13:38:28 1718977108

One group finds a way to automate a job, and then our whole society agrees that the people who previously did that job should be tossed out into the street. But for some reason we blame the first group rather than the second.

dsign · 2024-06-21T13:34:55 1718976895

It's a funny euphemism, in a dark sort of way. But if there is a domain where AI is not getting humans out of a job anytime soon, I think it's this one. I've read dozens of papers about remeshing, but for all of the research, very few algorithms make it to production pipelines. And those that do, still crash and fail in spectacular ways, even after a decade or more of refining and bug-fixing.

yazzku · 2024-06-21T18:34:52 1718994892

"Our method points to a promising approach for the automatically generation of Artist-Created Meshes, which has the potential to significantly reduce labor costs in the 3D industry, thereby facilitating advancements in industries such as gaming, film, and the metaverse. However, the reduced cost of obtaining 3D artist-created meshes could also lead to potential criminal activities."

That last statement is worded in such a weird way, lol. Funny Chinese->English transliteration.

"The FBI has issued a warning for potential criminal activity resulting from the automatic generation of low-poly models. The public is advised to minimize outdoors exposure and report any suspicious activity."

flockonus · 2024-06-21T05:02:25 1718946145

MeshAnything generates meshes with hundreds of times fewer faces, significantly improving storage, rendering, and simulation efficiencies, while achieving precision comparable to previous methods.

Joel_Mckay · 2024-06-21T06:39:29 1718951969

So does instant-meshes, but it also doesn't necessarily improve the topology.

https://github.com/wjakob/instant-meshes

Cheers =)

42lux · 2024-06-21T07:10:30 1718953830

The converted meshes are not efficient. They are also full of n-gons so you need to retopo no matter what...

andybak · 2024-06-21T09:20:44 1718961644

I can only see tris. It's quite rare to see .obj files with anything except triangles.

Or do you mean something like "contains implied ngons because of the way coplanar tri faces are arranged"?

Lichtso · 2024-06-21T11:03:32 1718967812

When working with meshes what you generally want is is quads, not triangles. The reason is that quads form nice closed loops.

Further more you would only allow quads to meet in 3, 4, or 5 edges per vertex. The 4 edges per vertex is the "normal" case that most of your mesh should have, it causes a regular grid of parabolic (euclidian) geometry with neutral curvature. Then patches of these meet in vertices with 3 edges to make it elliptic geometry with positive curvature or 5 edges to make it hyperbolic geometry with negative curvature.

You can ignore all of these and just randomly connect nearest neighbors to form triangles. But, then you still have only geometry, no useful topology, so not any better than a point cloud. A good topology is necessary for texturing, skinning, animation etc.

andybak · 2024-06-21T11:26:59 1718969219

Sure. I (mostly) knew all that. I was specifically asking why you said "thy are full of n-gons" - my understanding of the terminology seems to be different to yours in that "n-gons" means "5 or more sides on a face". i.e. not a tri or a quad.

42lux · 2024-06-21T11:49:37 1718970577

Your definition is right. Look at the produced models and not the images in the paper.

andybak · 2024-06-21T12:46:53 1718974013

I have generated two models and they consisted of nothing but triangular faces.

42lux · 2024-06-21T14:20:11 1718979611

Mate I really don't know how to help you but even on the examples in the pdf there are clearly n-gons. In 5 of my 10 test there were n-gons. There are always starfishes with 5 or more connected verts. If you want to nitpick on the wording go ahead but these meshes are shite.

andybak · 2024-06-21T18:42:34 1718995354

I wasn't picking a fight or scoring points. This isn't Reddit and I'm a grown adult. I'm trying to understand what you're saying and maybe learn something in the process.

> In 5 of my 10 test there were n-gons. There are always starfishes with 5 or more connected verts.

Ok. So you are basing your definition on the number of edges that meet at a vertex. My understanding was that the important metric was "number of edges on a given face"

rurban · 2024-06-21T18:53:17 1718995997

Generally you would prefer nurbs over quads. But your mesher and viewer needs to be good.

Paul_S · 2024-06-21T06:37:28 1718951848

Very good, hope they realise that you need tessellation for shading. Some of those models look a bit too optimised.

dagmx · 2024-06-21T14:05:47 1718978747

The topology is decent but no artist is creating meshes like this. The name feels mismatched. I’ve seen some better topology generation papers at siggraph last year which addressed quads better, though I’d need to dig through my archive to find it.

The triangle topologies in this paper made don’t follow the logical loops that an artist would work as. Generally it’s rare an artist would work directly in triangles, versus quads. But that aside, you’d place the loops in more logical places along the surface.

The face and toilet really stand out to me as examples of meshes that look really off.

Anyway, I think this is a good attempt at a reasonable topology generation, but the tag line is a miss.

iTokio · 2024-06-21T05:59:08 1718949548

Words have meanings, you can’t call AI generated meshes, “Artist created Meshes” not matter how good you think your results are.

Beside good topology is dependent on the use case, it’s very different if you are doing animation, a 3D print, a game or just a render.

Joel_Mckay · 2024-06-21T06:33:55 1718951635

Yep, hard to reason with industry people pushing slop on commercial production teams.

Low-poly re-mesh tools have been around for ages (some better than others), but there are good reasons pro's still do this step manually all the time. Primarily "good" is based on _where_ the quads, loops, and unavoidable n-gons end up in the model (or stuff ends up looking retro 90's.)

There is also the complex legal side of algorithms not being able to create copyrightable works in some jurisdictions. Talk with your IP lawyer, this area gets messy fast when something famous or trademarked is involved.

Cheers, =3

spookie · 2024-06-21T07:04:47 1718953487

That's fair, as someone pretty proficient in 3D modelling I understand your point. However, it also boils down to the scale of the project.

Imagine recreating part of real life city, creating a digital twin, for scientific purposes (testing human behaviour in fire hazards, or simply iterating on better park planning and road design for greater perceived safety). There's a lot to be done, and it's difficult to use procedural building methods if your aim is for people to recognize that area.

I'm making such a thing myself, purely academic, but god I wish I could speed things up.

Joel_Mckay · 2024-06-21T07:40:35 1718955635

Procedural emission of textures, biomes and cities is not ML/AI generated... Also physics simulation of erosion for landscapes may look natural to most people.

The problem is when groups start gleaning styles and artwork from 3rd parties to make something in the same style... they cross an ethical line, and a legal one in some situations (even if the original work is completely isolated from the output.)

Thus, while a stochastic parrot may be able to dodge outright plagiarism, it cannot sidestep copyright laws in some Markets.

I'd rather pay folks for royalty free content like Poly Haven offers to the community. =3

spookie · 2024-06-22T10:45:36 1719053136

Oh, for sure. Wholeheartedly agree. We are little by little eroding the foundations of an economic system which allows individuals to get recognized and rewarded for their hard work.

I may have not worded things well, I was trying to speak of modern 3D reconstruction methods, such as NeRF or Neuralangelo. I can see good uses for them, as I need to fool the senses reliably (participants will be taken to a VR world... mimicking a real place). But as many things in this field, the reality is that these methods aren't up to snuff. Still, it would be nice to be able to capture reality for non commercial purposes.

As for Polyhaven, haven't donated yet... but I hope to do so soon :)

Animats · 2024-06-21T17:43:47 1718991827

Hm. I tried the online demo,

https://huggingface.co/spaces/Yiwen-ntu/MeshAnything

on the provided sample "hat". I tried with and without checking "Preprocess with marching cubes" and "Random Sample". Both outputs had holes in the output mesh where the original did not.

Am I doing this wrong, or is the algorithm buggy?

emilk · 2024-06-21T06:59:43 1718953183

You can run it yourself here: https://huggingface.co/spaces/rerun/InstantMesh

T-A · 2024-06-21T10:37:19 1718966239

That's InstantMesh, not MeshAnything.

The MeshAnything demo is at

https://huggingface.co/spaces/Yiwen-ntu/MeshAnything

column · 2024-06-21T07:17:09 1718954229

I didn't find a way to download the mesh once generated. Is that option not available at all?

emilern · 2024-06-21T07:27:49 1718954869

Good question. You can if you use this instead: https://huggingface.co/spaces/TencentARC/InstantMesh

debugnik · 2024-06-21T14:13:23 1718979203

Calling these meshes "Artist-Created Meshes" is disgusting. I know researchers in this field want the word "artist" to follow the same fate as "computer" thanks to their work, but it's too soon to say the least. Can we get AI researchers? I bet RLHF can make their writing more humble than the current ones.

Sentiments aside, that's an impressive approach.

tamimio · 2024-06-21T23:13:55 1719011635

Looks interesting, I do have few complicated models will test it out and see.

RobotToaster · 2024-06-21T16:05:13 1718985913

Why do people keep making their own special licenses?

https://github.com/buaacyw/MeshAnything/blob/main/LICENSE.tx...

yazzku · 2024-06-21T18:41:05 1718995265

It's a BSD3-style license with a statement at the end that butchers the whole point of a BSD3 license.

demondemidi · 2024-06-21T05:55:09 1718949309

Hugh Hoppe is rolling in his grave.

Jarmsy · 2024-06-21T09:41:03 1718962863

Hugues Hoppe is alive and well!

https://hhoppe.com/

jahewson · 2024-06-21T05:48:50 1718948930

Stunning!

75viysoFET8228 · 2024-06-22T00:00:07 1719014407

the service needs to be better, please improve and errors in the configuration of the website