Improved code caching

nbpname · on April 24, 2018

This feature is really cool, however the ratio of parse-time and compile-time improvement is not always meaningful to end users. I would be curious to see the absolute & relative load-time gains.

When I was benchmarking Firefox on the same feature [1], back in November 2017, I noticed that reddit loads very little JavaScript that any improvement on it does not matter.

Also, I wish that the benchmark website domains were not truncated, such as "http://www.", and better distinguish websites such as "http://reddit." (for http://reddit.musicplayer.io) as opposed to "http://www.reddit." (for http://www.reddit.com).

[1] https://blog.mozilla.org/javascript/2017/12/12/javascript-st...

bgirard · on April 24, 2018

Websites are built to fit within the existing constraints of the web platform. It's important to look beyond what sites are doing today and instead look at what they cannot do. It's basically this story for web performance [1]. Here you're suggesting profiling and optimizing website designs that 'survive' and make it production but you're failing to observe designs and sites that don't make ship because of performance issues and you're just trying to optimize against cases where the designs survive.

In general parse-time and compile-time is meaningful enough that large websites are spending a considerable amount of effort and energy playing 'code golf' to keep JS size down otherwise engagement will suffer.

[1] https://medium.com/@penguinpress/an-excerpt-from-how-not-to-...

Klathmon · on April 24, 2018

They mention near the end of the article that it improved overall page loads (time to interactive) on Android 1-2%

xboxnolifes · on April 24, 2018

Fairly small, but all improvement is obviously welcome.

azeirah · on April 24, 2018

1-2% on a large scale is pretty significant. If each request on average takes, say, a second, and is then reduced by 1-2%, that'd be 990ms. 10ms saved per request.

Let's say that on average, 5 uncached pages are opened per day per chrome-mobile user. I don't feel like looking up the total amount of Chrome-mobile users, let's estimate that at 500 million.

5 page loads * 500,000,000 users * 10 ms = 25000000000ms per day saved, that's 289 days of page loading saved on a global scale per day.

I don't personally think measuring the improvement in terms of time is very interesting, it's very hard to interpret, but each ms of loading takes up a certain percentage of your battery, the amount of electricity saved daily on a global scale because of an improvement of 1-2% is fairly significant.

foepys · on April 24, 2018

I'd argue that as long as it doesn't increase CPU load, those 1-2% are very worth it on mobile. It's potentially 2% battery saved.

dymk · on April 24, 2018

Only if your phone is spending all its time parsing javascript, and maybe 1% (being generous) of its CPU time is spent doing that for a JS heavy page. So 1% of 1%. Really not much at all.

djeric3 · on April 24, 2018

Facebook.com, LinkedIn.com, and Google Sheets ship ~1MB of JS compressed for their main pages, which is 5-10 MB of JS uncompressed. So JS parsing time ends up taking hundreds of milliseconds on initial load.

And of course, people want to build even richer experiences (live streaming, 360 media, etc) so there is a lot of appetite for loading more code in the same amount of time.

xfalcox · on April 24, 2018

That's -60% parse time and -50% compilation time for Discourse[1].

I'm amazed by the changes the v8 team is pushing.

[1] https://github.com/discourse/discourse

grapeli23 · on April 24, 2018

Everything is great. Only why from the introduction Ignition in Chrome 59 (2017-06-05) and resignation from V8 with Full-codegen, I have been recording terrible regression on bellard.org/jslinux. Since then (Chrome 58), each newer version is 3-4x slower.

lbenes · on April 24, 2018

Have you reported these performance regressions on their bug tracker?[1] I can't find it. Just time for the kernel to boot or you measuring something else?

In my experience, they have been quite responsive to quality reports. Like Mozilla, they have many tools to assist QA.[2]

[1] https://bugs.chromium.org/p/chromium/issues/list

[2] https://www.chromium.org/developers/bisect-builds-py

grapeli23 · on April 24, 2018

Initially, it seemed to be a temporary regression. Later that it may be related to Meltdown/Spectre mitigations. Currently, it is well-known to developers and remains patiently waiting for improvement. https://bugs.chromium.org/p/chromium/issues/detail?id=827497...

lbenes · on April 24, 2018

From your report on the bug tracker:

> This is a non-regression issue as it is observed from M60 old builds.

No mention of the regression described here. If this is an issue you care about, I suggest you check out the bisect-builds-py tool I linked to.

Leszek · on April 25, 2018

It'd be interesting to revisit that project with wasm rather than asm.js.

cztomsik · on April 24, 2018

I love that project! BTW: how are other browsers?

gildas · on April 24, 2018

At SEO4Ajax, it looks like we're getting about a 5% performance gain when scraping pages (we use Chrome 68 as I write this message).

kodablah · on April 24, 2018

Can I use a timing attack to determine if my script has been seen before as a fingerprint measurement? Meaning, is there a way via timing checks to determine whether this is a cache hit which could tell me something about the user? Or is it per domain and effectively like storing a bool in local storage?

hashseed · on April 24, 2018

You were already able to do this by loading any other kind of cached resource.

kodablah · on April 24, 2018

While true, I was under the impression that there wasn't a cross-domain cache that wasn't opt-in. Again, though, maybe this is per-domain so it's moot.

londons_explore · on April 26, 2018

Simple cross domain <IMG> tags can have their load time measured..

sungju1203 · on April 24, 2018

but what is actually being cached?

Leszek · on April 24, 2018

V8 engineer here: basically, a serialized walk through the object tree, starting from the top-level script. Predominantly, this means bytecode, scoping metadata, and strings.

nielsbot · on April 24, 2018

Is x86/ARM binary code ever cached? Or is that not feasible? Seems like that could save a lot of time?

Leszek · on April 25, 2018

Good question: unfortunately not. We only really generate machine code for hot, optimized functions, using TurboFan (all the other machine code, like builtins and the interpreter, is loaded from the "snapshot"). This code is pretty execution specific, and will include things like constants, raw pointers, logic specific to the hidden class transition trees, etc.. Relocating this when serializing/deserializing a cache would be expensive, and we'd probably have to also serialize half the state of the current VM heap, so it would overall be a net loss.

Additionally, our current caching is only after initial top-level execution, where hopefully[0] you haven't run enough code for it to be hot yet, so there wouldn't be any optimized code to cache anyway.

[0] I say "hopefully" because getting code hot during initial execution would usually mean that execution is stalling the main thread at startup, which isn't great.

thekingofh · on April 24, 2018

I'd like to think so, though wouldn't dynamic typing insist branching would still be possible so there's still likely some runtime checking before being sure a precompiled bit is valid. All fascinating stuff.

nielsbot · on April 25, 2018

I think you solve that by dynamic function specialization and perhaps also trace compilation? https://en.wikipedia.org/wiki/Tracing_just-in-time_compilati...

AstralStorm · on April 25, 2018

Could someone add here a note in the title that it applies to JavaScript on V8 engine? I thought this was something related to caching actual executable code in all languages.