WebKit on GitHub

tolmasky · on Aug 31, 2022

Funny story: my first task when I joined the original iPhone team was to merge our forked WebKit with master. It was a sort of hazing ritual slash "when else would we do it but when someone new joins?". Anyways, we used a tool called SVK[1] in order to get very primitive "git-like" abilities. It was basically a bunch of scripts that used SVN under the hood. For example, in order to get the "local everything"-style behaviors of git, the very first thing it did was checkout every single version of the repository in question. For WebKit, this meant that the first day was spent leaving the computer alone and letting it download for hours. I made the mistake of having a space somewhere in the path of the target folder, which broke something or other, so I ended up having to do it all over again.

Anyways, I distinctly remember one of the instructions for merging WebKit in our internal wiki being something like "now type `svk merge`, but hit ctrl-c immediately after! You don't want to use the built-in merge, it'll break everything, but this is the only way to get a magic number that you can find stored in [some file] after the merge has started. If it's not there, try `svk merge` again and let it go a little longer than last time." A few hires later (I think possibly a year after) someone set up a git mirror internally to avoid having to do this craziness, which if I remember correctly, was treated with some skepticism. This was 2007, so why would we try some new-fangled git thing when we had svk?

1. https://wiki.c2.com/?SvkVersionControl

evmar · on Aug 31, 2022

We had a similar rotation on Chrome team for merges from WebKit (pre fork), and it was similarly a lot of work and clunky tooling!

A few times in my career (including this one) I have thought, "We are sure going to a lot of effort to maintain a modified copy of that code while also preserving our changes atop it as we sync, and this is exactly the kind of workflow that Git was designed to enable." Like, the Linux kernel dev workflow is all about different maintainers maintaining different branches and merging between them, and that is where Git comes from.

So in a setting other than Chrome I have tried out using Git to try to manage these sorts of situation. I have found in practice many engineers aren't comfortable enough with Git to have it end up helping them out tooling-wise. This is disappointing but also not too unexpected given Git's UI.

js2 · on Sept 1, 2022

I worked at Rockmelt from 2009 till it was acquired by Yahoo. You've never heard of it, but we had a browser built on-top of Chromium that had built-in integration with Facebook, Twitter, Posterous, RSS feeds, everything social of the day:

https://imag.malavida.com/mvimgbig/download-fs/rockmelt-8383...

I was our build/release engineer. One of my jobs was keeping our code rebased on-top of Chromium.

It wasn't terribly painful but I could always tell when a new engineer joined Chromium because they'd inevitably rewrite some major component and there'd go my day porting our code onto the rewrite. (I did more than just resolve merge conflicts. I also took a first pass at updating our code before handing it off to the rest of the team.)

We were using git from the beginning. Chromium was using svn then, but Google had an official git-svn mirror and I worked from that.

It's been a while, but I recall that I switched us from working off the tip-of-trunk to working off of release branches. That became feasible when Chromium switched to its more frequent release schedule (2 weeks I think?).

AdriaanvRossum · on Sept 1, 2022

I remember it having a logo from the earth globe and some cracks in it [1].

Funny how our perspective on a browser and social media changed.

[1] https://is4-ssl.mzstatic.com/image/thumb/Purple30/v4/9e/0c/3...

jyz · on Sept 1, 2022

Oh wow. I remember using that browser and going to your meetups getting swags!

lawik · on Sept 1, 2022

No I've heard of it. It had some hype in the early social and app days.

taberiand · on Aug 31, 2022

I don't understand any developer's that aren't willing to put in the time to learn how to use Git - to me, it's the single greatest tool available to enable productivity and confidence in changing code. There's no shame in using any one of the many GUI interfaces for Git that make the process simple and intuitive but even with the CLI, there are only a small handful of commands that I regularly need to use to do all the work of managing branches, merges, rebases and resets; and a lot of the time, there's more than one way to do any particular operation.

eyelidlessness · on Sept 1, 2022

> I don't understand any developer's that aren't willing to put in the time to learn how to use Git

I can effectively use, I don’t know, probably the about 5% of git that I need to do my job. It could maybe benefit me to learn 5% more? The rest feels like a trivia rabbit hole, I have shit to do, and I already have a lot of difficulty with cognitive load and far too many yaks to shave daily. With that said, I have no strong disagreement with you, but I do want to add a bit of nuance: “learn how to use git” is extraordinarily open-ended, and one of the most challenging parts is to know where to even start. Or continued learning has any practical benefit.

A little less nuanced: I encourage everyone who will listen, even experienced devs, to use a GUI frontend. Not just because GUIs do ~90% of what you’ll normally need in a daily workflow (which is especially good for noobs who learn which things are important to know first). Also because the GUIs generally have really obvious cues for how to unfuck a mistake, which is the most difficult thing for people who aren’t already well adjusted to git. I use a git GUI for almost all of my version control tasks, and I’m much more effective for it.

I’ve learned more how to use the git CLI for an art project than I’ve ever needed to know for normal work processes.

(Self nit: bullshit made up approximately educated guess percentages)

bacon_waffle · on Sept 1, 2022

This (and the reply to js2) gives me the impression that "effectively use 5% of git" means something like "effectively use 5% of the git CLI surface", but to me that's not what "learn how to use git" means.

One way or another, I've wound up becoming the git guy at work, and think I'm on the more proficient side of average. But, for most anything outside of commands I use daily (or can easily find in shell history), I'm off to the git docs or a search engine. Just about always, I know what I need git to do, it's just a matter of finding an incantation to do it.

To me, there are two pieces to learning git. The lower level is about git itself: understanding that commits are snapshots and the diffs are calculated on-the-fly, that cherry-pick is an automated version of diff and patch, rebase is like a way to compose cherry-picks. The other layer is about how the organisation uses git - like how to fork a project on github, push some changes to a branch in your fork, make a PR to propose those changes to upstream, why committing directly to main is a bad idea.

Someone · on Sept 1, 2022

> understanding that commits are snapshots and the diffs are calculated on-the-fly

I never understood why that gets mentioned as essential for understanding git. A version control system has to be able to produce both all versions of a given file and diffs between versions of a file, but how it does that is an implementation detail. There are many options there: full first version with diffs to newer versions, full last version with diffs to older versions, variants of those with full versions inserted every now and then, mixes of forward and backward diffs (as is done in some video formats, if you see each frame as a version of a file), etc.

Of course, there may be performance reasons for choosing an implementation. I could understand statements such as “Because of the existence of merge commits, it’s easier to store full files, rather than diffs, as a merge commit would have diffs with each of its parent that must be kept consistent”, but the “understanding that commits are snapshots” claim isn’t about implantation details, but claimed to be essential to understand git.

bacon_waffle · on Sept 1, 2022

That's a good question, and I don't have a straightforward answer.

I think git sits in a sort of uncanny valley, where it's possible to go a long way treating the tool as a black box (and you're totally right - snapshots vs diffs is an implementation detail), but in practice, to really "get it" it's necessary to understand a bit about what is going on under the hood. The problem space that git addresses is inherently difficult, in contrast git internals are quite straightforward.

So, a person wanting to learn git has two options: they can keep the hood closed, study the manual, and fret over all the complicated looking knobs and switches. Or, they can have a look under the hood, realise that there really isn't much to it. Most of those controls just aren't relevant to do what they need to do at the moment, and when a problem arises they have a much better idea of where to look for solutions.

In my experience, being a well-rounded software engineer (for instance able to collaborate with a team at work, and make the occasional PR to random outside projects) requires a certain level of git proficiency. At that level of proficiency, the black-box approach seems to have a much steeper learning curve than the look-under-the-hood approach.

At a less philosophical level: git is a collaboration tool, and people get hopelessly confused about this stuff when they try to collaborate without sharing a common language. "How do I email a commit?" is the sort of question I get asked occasionally.

nicoburns · on Sept 1, 2022

I think a key thing that may be trying to be communicated here is that in guy, commits are primitive, and not branches.

js2 · on Sept 1, 2022

> “learn how to use git” is extraordinarily open-ended, and one of the most challenging parts is to know where to even start.

It's not really. It seems like it is because it has a baroque CLI with a ton of commands and those commands all have a ton of switches and the same commands can do lots of different things.

But under that atrocious CLI git is very elegant and simple conceptually. Once you understand the underlying concepts (blobs, trees, commits, refs, index/stage) the rest flows naturally.

Now I've been working with it since its earliest days including contributing to it here and there, but I never really found it very hard to grok.

"Git from the bottom up" is dated but git hasn't changed conceptually since then. "Git for computer scientists" is another good one. But any of the tutorials that start with git's low-level concepts and build from there are where I'd start.

eyelidlessness · on Sept 1, 2022

> It's not really. It seems like it is because it has a baroque CLI with a ton of commands and those commands all have a ton of switches and the same commands can do lots of different things.

eyelidlessness · on Sept 1, 2022

This quote-only reply was meant with kindness, but you made my point far better than I could have made it myself.

js2 · on Sept 1, 2022

Do you want to be right or do you want to win? I was trying to remain sympathetic to your concerns while offering an alternate approach to learning git you might not have considered. I don’t have any trouble with git’s CLI but I understand why people find it frustrating. That is all.

eyelidlessness · on Sept 1, 2022

> Do you want to be right or do you want to win?

Neither! I appreciate your perspective, and wish I’d emphasized that more than I did.

vbezhenar · on Sept 1, 2022

I’ve yet to meet anyone using git command line in their day to day activity. Everyone’s just using IDE. So IMO git CLI is not the issue in most scenarios. You might want to have git guru in your team to resolve some particularly hard situations, but that should be an exceptional situation.

CodeSgt · on Sept 1, 2022

Inversely, I don't think there's a single developer in my work center that routinely uses git anywhere but the CLI. I think CLI git usage may be more common than you believe.

evmar · on Aug 31, 2022

To be clear, I think many developers are comfortable enough with the small handful of commands most regularly use. For the more advanced case of maintaining a fork of a high-velocity codebase like WebKit, it's likely you'll need a deeper understanding of remotes and how to manage complex rebases, especially in the presence of lots of conflicts. And possibly some fancier tools like git-subtree. In particular in WebKit my recollection was it was common to patch something locally and eventually take it upstream, but after upstream's requested modifications the patch would eventually come back around and conflict with itself.

filmgirlcw · on Aug 31, 2022

> I don't understand any developer's that aren't willing to put in the time to learn how to use Git

It’s not like git invented version control or even distributed version control. Stuff predated git and git wasn’t even the only solution that was coded together when BitKeeper changed license terms (personally, I wish Mercurial had won because it has a much better interface, but what won won and I’m happy enough with git to not really care).

Putting all that aside, systems and codebases have their own workflows for a reason. I’m sure the reason WebKit was on SVN for so long wasn’t because the committers weren’t willing to use git (we’ve seen in this comment thread revelations by former Apple engineers who admit that they maintained a shadow git-based codebase internally), I’m sure almost everyone involved uses git. But for whatever reason, there were blockers to migrating (and some of those are explained in the WebKit blog).

Now, as an outsider, I might think that waiting this long to migrate to git, a solid decade after it made sense to do so, is odd. But I don’t have the context. I don’t know the reasons why, and for what it’s worth, the git-svn mirrors seemed to be working well for the people working on the project.

WordPress, a much more active open source project, at least in terms of outside contributors, is also still on SVN. Like WebKit, most contributors work on the git mirrors rather than using SVN. As an outsider, I can also think that it’s ridiculous for that project to still be on SVN, but again, I don’t know the context. I don’t know the blockers, I don’t know the workflow considerations.

But I do feel confident that none of these decisions (or lack of decisions) were made because developers weren’t willing to learn git.

brundolf · on Sept 1, 2022

The problem is that you spend 95% of the time using it on the golden path; add, commit, push, maybe click a merge button somewhere. At least in my experience

So it can be hard to learn the hard parts by doing, because by the time you need to do them again, you've forgotten what you learned last time

rtpg · on Sept 1, 2022

I feel like you do need a bit of rigor, especially because rebases and the like often don't do what you intuitively want, leading to merge conflicts over and over again when trying to clean things up.

Even small stuff like not knowing to enable rerere means that rebases have incidental complexity compared to, for example, outright recreating commits in another spot.

I disagree with most people about mercurial being "better" (I love my staging area), but doing stuff with git in practice can be super duper fiddly. I've found it's much nicer to do the right thing with stuff like magit though. I bet there is a great UI that is yet to be built for most people that makes it easier to do stuff like "updating old commits" or other things that you end up having to do through rebases.

tambourine_man · on Sept 1, 2022

I use the bare minimum of it and am not happy. I’m sure it’s a great fit for the Linux kernel, but for a sole developer or small team, not so much.

I’d rather just have several independent file system snapshots of a particular folder + logs. I’d like to rewrite history easily. But that ship has long sailed.

saagarjha · on Aug 31, 2022

> I don't understand any developer's that aren't willing to put in the time to learn how to use Git

Other version control systems exist. Why learn Git if you’ve already got one that works?

acdha · on Aug 31, 2022

In this case, was it really working? It sounds like they had enough friction to make considering alternatives reasonable.

In general, I agree with your point but I think there's a very pragmatic argument that the open source world heavily converging on Git means it's worth knowing even if it's not your primary tool. In the 2000s that was spread out more with CVS, SVN, Mercurial, etc. also being good candidates depending on what communities you worked in but it's been quite a while since I've seen even Mercurial being used in the wild.

solarkraft · on Aug 31, 2022

I'm totally willing to learn the magic of advanced merging, but most "tell me more about Git" talks/articles rather want to tell me more about its general internal structure, which I find very far removed from actually using it.

So what's the best resource for learning more about using Git?

LegionMammal978 · on Sept 1, 2022

Personally, I found https://learngitbranching.js.org/ surprisingly helpful for getting a mental model of Git usage, even though I had already been using Git for a couple years. As always with learning resources, YMMV.

rascul · on Aug 31, 2022

https://git-scm.com/book/en/v2

dekhn · on Sept 1, 2022

... and that's how I ended up with a detached head.

girvo · on Sept 1, 2022

The amount of developers who don’t know about “bisect” alone makes me sad. Such a powerful tool especially when you’ve got a good replication of a bug in some kind of test-like thing you can run.

danudey · on Sept 1, 2022

We've been working on a migration from SVN to Git (since 2019, still going strong!) and, since our workflows are all built around SVN assumptions and SVN workflows, it has served only to make everything more complicated and more finicky than it would have been.

I'm looking forward to doing more modernizing of things over time, but for now most of our work is trying to map SVN semantics and structures onto Git, and dealing with weird tooling issues.

Case in point, the Git SCM branch source that the Jenkins multibranch plugin uses; all we need to filter on is branch name, which you can get from `git ls-remote <remote>`, but because of the way it's designed it actually clones the entire repository (between 4 and 12 GB) and then checks the branch list. Nightmareish.

Still, a lot of stuff works a lot better, and now we have more and more developers and teams testing out or using Gitlab's features, like merge requests, and more and more projects trying to modernize their approach with CI builds and the like. It's a very exciting time.

ridiculous_fish · on Sept 1, 2022

Fun story indeed!

I worked in the Cocoa group when we migrated from cvs to git. But it wasn't exactly cvs, it was ocvs, which was an ancient forked version of cvs that handled certain directories (nibs) as tar files.

There was no direct importer from cvs to git, so we had to go through Subversion as an intermediate. This was Cocoa so its history was very long, but the earliest history turned out to be mangled, so some poor fellow was tasked with learning how to repair CVS history, only so it could be migrated to git.

Once we got to git our lives were so much better. I'm glad we skipped svn.

ocvs man page for any masochists reading this:

https://opensource.apple.com/source/cvs_wrapped/cvs_wrapped-...

justinator · on Sept 1, 2022

I'm curious why it wouldn't make sense to just think, "Before THIS DATE we just don't care about history anymore".

I look at my main project and I'm fairly happy to forget about 90% of the branches I have hanging around and half the lifetime of coding it, and it's just 100k LOC.

(except I haven't. Why haven't I?)

saagarjha · on Sept 1, 2022

It’s generally useful to have history around when you’re digging through a codebase for bugs or curiosity.

justinator · on Sept 1, 2022

But where do you draw the line when migrating to a superior source control system isn't possible because of your current source control system?

If curiosity is what you're wanting to allow, keep the current source control system on ice, and move everything to the superior one at its current state + a few releases back. Save a decade or two.

It seems to me a version (sic!) of technical debt otherwise.

gsnedders · on Sept 1, 2022

I think it's especially valuable to try and be complete when doing a migration from a centralised VCS to a decentralised one; once you manage to migrate the repository once, you can then avoid any necessity to maintain servers for it.

In extreme, you could just distribute a tarball of the decentralised VCS, hence all you need is the ability to distribute a tarball, which is a lower bar than maintaining servers for it indefinitely.

mrpippy · on Sept 1, 2022

Did that cvs history date back to NeXT? Or was there some initial Rhapsody import?

LordDragonfang · on Sept 1, 2022

As a (relatively) young dev, I always find it wild whenever I'm reminded that git is such a young tool, because it feels like it should be contemporaries with vim, perl, or even emacs with how fundamental it is to modern software development. Hell, I remember when python was the fancy new kid on the block, and even python2 is a full five years older than git.

scambier · on Sept 1, 2022

I started working in 2012, and my colleagues used usb keys to pass projects around. I was a bit... surprised.

frou_dh · on Sept 1, 2022

The thing that continues to feel strange to me is that Python is actually older than Linux.

lostgame · on Sept 1, 2022

That someone who worked on the iPhone Team responded here - much less with such a detailed reply - is the reason Hacker News is one of my internet GOATs.

Thank you - first for your contribution to the product; and secondly, thank you for opening up a window into the glass wall that is Apple.

emmelaich · on Aug 31, 2022

You can usually rsync the whole lot at first then and checkout from the copy.

Then fiddle with the SVK config to set upstream to the real origin.

OR have admins do a local checkout on the server for people to rsync, then do other appropriate config fiddles.

danudey · on Aug 31, 2022

If you have that access, you should use svnsync instead. For a large repository, running rsync can take hours just hashing all the individual files on either end to compare, just to eventually update one or two revisions. svnsync is much more 'aware', and so it's much easier to keep up to date if necessary.

emmelaich · on Sept 1, 2022

True. Or just zip/tar it up and then rsync that. To be clear I was talking about a one time thing, not continued updates -- which would be done with SVK

lnxg33k1 · on Sept 1, 2022

It’s not funny at all if you lived with these kind of bullshits

sedeki · on Aug 31, 2022

They mention that they'd ideally want a natural ordering on the commit hashes. Something to do with their zero-tolerence security policy.

What's the background there? Why do they need a natural ordering?

anderskaseorg · on Aug 31, 2022

Git already has an ordering like this built in as ‘git describe’.

https://git-scm.com/docs/git-describe

    $ git describe 593a2a5d0639b4b4f91ff6e6ffb64e72020f8fd8
    v2.34.1-83-g593a2a5d06

This commit is 83 commits after the v2.34.1 tag. Git accepts this identifier anywhere it would accept a commit hash, e.g.:

    $ git log v2.34.1-83-g593a2a5d06
    $ git show v2.34.1-83-g593a2a5d06:branch.c

https://git.kernel.org/pub/scm/git/git.git/commit/?id=v2.34....

cerved · on Aug 31, 2022

they want a global, presumably centralized, order

Too · on Sept 3, 2022

Describe works it’s way backwards to find a tag matching the search pattern. If you are checked out on origin/master and the tags come from the same centralized origin, then you will have a predictable global order.

It’s basically the same thing as rev-list that they do, except more readable, with tighter integration to tags and with the result usable as a commitish.

jamesfinlayson · on Sept 1, 2022

Neat - I periodically see those incremental ids show up in places but didn't realised they worked like commit ids. But I'm not surprised.

iam-TJ · on Sept 1, 2022

For all the interesting ways to name an object, see [0]:

  man 7 gitrevisions

and for this particular naming "describeOutput"

[0] https://manpages.debian.org/bullseye/git-man/gitrevisions.7....

tln · on Aug 31, 2022

"zero-tolerance performance regression policy"... no patch can land if it regresses benchmarked performance.

I'm guessing the tooling around this used subversion's increasing commit numbers and it was easier to add a shim to git, than to rewrite or rethink the tooling.

usefulcat · on Aug 31, 2022

> no patch can land if it regresses benchmarked performance

..unless it fixes some important security vulnerability, one hopes..

btown · on Aug 31, 2022

Via https://webkit.org/performance/ :

> If a patch lands that regresses performance according to our benchmarks, then the person responsible must either back the patch out of the tree or drop everything immediately and fix the regression.

I imagine that a security hotfix would lead almost immediately to the second situation (perhaps as soon as the implementor had gotten some sleep!)

mhh__ · on Aug 31, 2022

the trick is to change the benchmark at the same time

xfmpXIe76lF4GfR · on Aug 31, 2022

git literally has built-in tooling for this. It's called bisect (and they literally mention "bisection" in the next sentence).

howinteresting · on Sept 1, 2022

The builtin tooling is insufficient for many purposes, including if your bisect algorithm requires you to run tests across many machines. Many large projects write their own bisect framework because of this.

Too · on Sept 3, 2022

It is intuitively a bit hard to believe, but bisecting in parallel is actually not much faster than serial. In my experience you just save the final one or two steps in the bisection - regardless of how long the range is!

Imagine bisecting builds with parallelism of two, when one of them completes, there is a 50% chance that the build on the other side of the range is now uninteresting. If you are lucky and it is interesting, you’ve only saved yourself half a step in the bisection, because when running in parallel you sliced the range in 3 rather than 2. Adding even more parallelism just makes this effect even worse.

Someone can probably work out the math better than me but you can quickly see that for 2x build power you instantly waste half the results for very marginal gain.

Just comparing the big O should also tell you this, parallelism only buys you O(N) while bisecting is O(log N).

alerighi · on Sept 1, 2022

A security fix most of the times does have a performance impact. What do they do?

gernb · on Sept 1, 2022

> no patch can land if it regresses benchmarked performance.

That's complete BS. Just go search for all the perf regressions in the issue tracker.

kelnos · on Sept 1, 2022

> benchmarked

Perhaps the regressions in the issue tracker are for things without benchmarks.

nemetroid · on Aug 31, 2022

Zero-tolerance performance policy. You can find the policy [1] by searching the web. And the hashes don't have to be ordered.

1: https://webkit.org/performance/

0xfffafaCrash · on Sept 1, 2022

I wonder if this policy, or the general mentality that produced it, is why webkit is so underdeveloped and widely reviled by devs compared to alternatives.

Sounds like one can push in a security bug or performance increase that is fast because it is frankly completely broken, but wouldn’t be allowed to reverse the decision at least not without finding some alternative unrelated performance increase. Features or fixes that may obviously be worth an associated dip in performance can generally not be implemented.

Under these conditions if I knew of ways to improve performance without adverse effect, as a developer I would sit on them instead of applying them because I would need a stockpile of reserves to apply in case of emergency to account for unrelated performance degrading changes that absolutely need to be made. I would also work to make sure the benchmarks were lousy and irrelevant.

Policies like this can’t be written by people with any semblance of sense. Singular mindedness on one metric and extreme policies like this can kill development and subvert goals just as lack of discipline can. Everything is a balance. Weigh performance metrics heavily by all means, but absolutism leads to the absurd.

I guess as long as Apple holds its monopoly on keeping iOS device browsers exclusively on Webkit, it will stick around though.

Sadly having had years of an inside perspective on the competence level or lack thereof of decision makers in the non-revenue generating parts of their software business, can’t say this policy surprises me.

saagarjha · on Sept 1, 2022

You wonder wrong.

bjackman · on Sept 1, 2022

Seems like the other commenters understand this already but it took me a while to figure it out so for anyone else that's confused: IIUC by "natural ordering" they mean you can tell the ordering just by looking at the IDs.

Funnily enough I have the opposite desire - I've worked in a VC system with "natural ordering" and it once led to an incident, where I visually compared two version IDs and said "yep this release has the bug fix". Turns out this is hard to do accurately for big numbers and I was wrong. I put a big warning on our ops documentation saying "never compare version IDs visually" with a link to the postmortem!

Trufa · on Aug 31, 2022

I have literally 0 inside knowledge but from the article it seems to be a more human visual thing than a software problem, something like this was working in 12 and broken in 13 is a more obvious regression than this was working in aaab131 and broken in ccad53s

LeifCarrotson · on Aug 31, 2022

13 being greater than 12 is not a property that's just for human vision. In Subversion a commit on a branch increments the global commit number.

Git doesn't have a concept of one commit being before or after another once you've branched, or any native mechanism for enforcing global state across branches.

mattkrause · on Aug 31, 2022

Sequential IDs also let you think about ranges: a feature was introduced in 11, broke in 17-23, and worked thereafter.

I used SVN like this in grad school: data files included the SVN $Id$ of the script that generated them. This let you work around bugs and experimental changes. For example, you might hardcode a delay, realize it should be longer, and then eventually decide to let the experimenter adjust it on the fly. This is easy with sequential ids:

   if version < 11:
      delay = 50
   elif 11 <= version < 29: 
      delay = 100
   else
      delay = params.delay

Using git hashes, you'd need to maintain an exhaustive list of every version ever run, which is even tricker because there isn't a sole source of truth like an SVN repo.

alerighi · on Sept 1, 2022

Git hashes are not for identifying versions but only to identify commit (that may not have a significance on their own, e.g. a developer uses more commits to implement a new feature!). To identify version you should use tags that can be written on the format you like. A tag is just a non mutable (unlike beaches) pointer to a particular commit.

In my company we have the CI server that automatically creates a new tag (sequentially) each time one pushes on the master branch.

cerved · on Aug 31, 2022

I must be missing something because this seems odd, why isn't this code just different in each corresponding version?

mattkrause · on Aug 31, 2022

The SVN-controlled code generated data by controlling hardware and embedded the $Id$ of the controlling script in its output. I would then refer to the version ID later, when loading that data in for analysis. This accounted for any changes to the data-generating code.

For example, we tracked the orientation and direction of objects moving on a screen. One update redefined 0° to be up/north/12:00 instead of the +x direction used before. The code which loaded these files checked the $Id$ value and rotated the directional data so that the entire dataset used the same definition.

xfmpXIe76lF4GfR · on Aug 31, 2022

Git also lets you think about ranges. You just tell it the range and it figures out what commits are in the range. You can also get a sequential number from whatever point you choose with tools like `git describe`.

kelnos · on Sept 1, 2022

I remember when I was migrating projects from svn to git, I was also concerned about the difficulty in telling order of commits at a glance.

Turns out it ultimately doesn't matter, and after nearly 15 years using git, I have not once cared about ordering of commits.

latchkey · on Aug 31, 2022

As someone who was there in the very very early Subversion meetings, I'm surprised it took this long for them to migrate.

olliej · on Aug 31, 2022

The Git dev model is _very_ different from cvs, svn, etc so the trade offs are less obvious.

A lot of the benefit of git to me has always been the local development model, but the git-svn bridge made that largely transparent which I think lowered the pressure to change.

latchkey · on Aug 31, 2022

> The Git dev model is _very_ different from cvs, svn, etc so the trade offs are less obvious.

I think that the difference makes the tradeoffs using anything other than git, more obvious. I even held out myself for a very long time with svn vs. git and once I switched... I kicked myself for not doing so earlier.

But, like they said in the post... they did need a feature, which is core to svn (incrementing changelog ids) and a workaround in git. Minor in the grand scheme of things.

xtracto · on Aug 31, 2022

I remember migrating some codebase from CVS to SVN ... and this was sometime after CVS was adopted instead of "at the end of the day, every dev will copy his change into a floppy disk and give it to the Tech Lead for merging".

This was during the 90s in a software development company in Mexico. Good times!

hnfong · on Sept 1, 2022

> This was during the 90s in a software development company in Mexico.

Version 1.0 of svn was released around 2004. According to wikipedia the project was started around 2000.

(Sidenote: there seems to be cognitive dissonance that svn was released much earlier than git ... but svn was released in 2004, and git in late 2005. There's a less than two years gap in between, yet so many projects had been "stuck" with svn...)

ayewo · on Sept 1, 2022

IIRC, svn was marketed as "CVS done right" [1], which meant devs who were exploring alternatives to CVS didn't have to worry about svn's learning curve.

So, I think inertia may have been a huge part of why adoption of svn was so good compared to git that has a steep learning curve, even today.

1: Of course, Linus Torvalds believes you can never do CVS right https://www.youtube.com/watch?v=4XpnKHJAok8 and there's a transcript: https://gist.github.com/dukeofgaming/2150263

ReptileMan · on Sept 1, 2022

Unpopular opinion - git is just not that good of a source control tool. Amazing as a distributed system, but mediocre in its primary function.

antris · on Sept 1, 2022

Source control is an inherently distributed system. Each developer has their own copy of the code that are distributed across many computers, and they need to communicate between each other. That's exactly what the primary function of the software is.

flohofwoe · on Sept 1, 2022

That's just as true with SVN, (for the local in-progress working copies) but with a single, centralized history. Which is how most devs use git anyway (essentially emulating a centralized versioning system workflow), and in this case the distributed nature of git allowing many alternative histories is unnecessary in the best case, and massively gets in the way in the worst case (because for most projects a single history is actually a feature).

saurik · on Sept 1, 2022

(In a very real sense, the server from Subversion does the same job of the optional server from Operational Transform: it is acting as a sequencer. The next thing we could do is replace that with proof-of-work, and then we could have a fully decentralized system with the properties of Subversion ;P.)

ReptileMan · on Sept 1, 2022

Git is true mesh source control system. You only need one .git folder to survive and you haven't lost anything. My memory is fuzzy but with svn if you lost the central repository you were in a world of hurt.

This is what git does right. What git lacks for me is smoothness in interaction with the user. To use git well you must think like git. Which is kinda annoying when I prefer my tools to think like me.

When I moved from svn to mercurial it was absolutely painless. On my next company git was terrible experience. I am sure that git tools and flexibility are amazing for linux kernel and other projects of the scale. But probably are overkill for smaller stuff when a more friendlier user flow will be nice.

morelisp · on Sept 1, 2022

My memory is that SVN was very popular long before 1.0, I feel like by 2002 it was "beating" CVS for new projects among hobbyists. It was the exemplar for VC in my SE course at university in early 2003.

Lorkki · on Sept 1, 2022

As I remember, the initial versions of Git would have been a chore to use; the intention was for it to be only the "plumbing", with a more friendly front-end layered on top. So it took a good few years to change that, and for it to catch on.

miohtama · on Aug 31, 2022

What's a decade or two in the grand scale :)

bastardoperator · on Aug 31, 2022

If they had floating or dynamic externals and a bunch of permission models, I'm not surprised at all. I'd label this more of a conversion versus a migration.

nimbius · on Aug 31, 2022

github has had more than fifty outages this year alone, and has a rocky history of recourselessly banning users from countries that are sanctioned by the United States. switching to github makes no sense if "The WebKit project is interested in contributions and feedback from developers around the world."

https://www.githubstatus.com/

who made this decision?

maxwell · on Aug 31, 2022

> recourselessly banning users from countries that are sanctioned by the United States

Are you suggesting that Microsoft should intentionally opt not to comply with OFAC sanctions?

Do you know of any non-OFAC sanctioned entities that have made that choice?

Are you aware of any OFAC sanctioned entities that maintain public accounts on any other code sharing sites?

joecool1029 · on Aug 31, 2022

AFAIK Github can provide free access to public repositories even if the users are subject to OFAC sanctions. In some cases they've applied for (and received) exemptions to allow for sales of paid services: https://github.blog/2021-01-05-advancing-developer-freedom-g...

xur17 · on Aug 31, 2022

They also banned developers that worked on Tornado Cash (not just the project, any developers that worked on it), a project that had a deployment put on the OFAC list. It's almost universally agreed to be an unnecessary step by Microsoft.

acdha · on Aug 31, 2022

Did they ban anyone other than the core project developers? There were specific people called out by name in the Dutch press release believed to have personally profited from North Korea's money laundering through Tornado Cash. That's pretty different from “any developers”

I note that Matthew Green's mirror and GitHub account do not appear to have been blocked, which would fit with the idea that there's more to this than just committing code:

https://github.com/tornado-repositories

schmichael · on Aug 31, 2022

> github has had more than fifty outages this year alone

I'm a heavy user of github every day and maybe 1 or 2 of these caused me any disruption whatsoever. Most of the time I think they created productivity boosts as people just focused on what they were working on instead of reacting to Github notifications about issues or PRs or failing tests or whatever.

> a rocky history of recourselessly banning users from countries that are sanctioned by the United States

This is likely a feature for companies, projects, and organizations who have (or want) to adhere to the same strict regulations.

klodolph · on Aug 31, 2022

I think the bigger picture here is the migration to Git, since that lets you keep working during an outage. SVN does not.

ajross · on Aug 31, 2022

In fact the very fact that your source control hosting service can be surpringly[1] unreliable is the best advertisement for git you could imagine.

In fact, if github disappeared from the internet today, all but the largest projects could just set up an ssh-accessible box somewhere and continue work (code review and issue interfaces notwithstanding, of course), probably with 24 hours.

[1] I work in github-cloned repositories almost full time. And sure, I remember a handful of times over the past 4-5 years where it's been down when I wanted to push something. I had no idea it was 50x/year! And that's because "working in a github-cloned repository" doesn't, in fact, require much contact with github itself.

lapinot · on Aug 31, 2022

> I think the bigger picture here is the migration to Git

Is it tho? Why wouldn't they just install git on their server? Now there is not many mainstream successful social hosting for svn. They acknowledge the choice of github is to attract devs. So it's as much about the software as about the type of hosting and web presence.

acdha · on Aug 31, 2022

> Why wouldn't they just install git on their server?

That's easy, and it's about 5% of the functionality which GitHub provides. Even if you're working entirely in private, the tools you'd have to build yourself to do code review, CI/CD, package management, security updates, etc. are a significant amount of work and that's before you get to things like Codespaces.

the_gipsy · on Sept 1, 2022

I doubt webkit would do anything beyond code review on github.

acdha · on Sept 1, 2022

Code review itself is a big deal in terms of the complexity of the UI for managing reviews but I’d also be surprised if they didn’t use anything else. Linting and other static analysis checks, reporting CI results, etc. are quite powerful and less work than setting the equivalent infrastructure up yourself.

the_gipsy · on Sept 1, 2022

Ah yes, they will 100% have a working review process with some other tools, no need to migrate to github's which isn't really flexible anyway.

eyelidlessness · on Sept 1, 2022

> about 5% of the functionality which GitHub provides

Are you sure? I can’t even use “go to file” on GitHub and stay on a selected ref, I can’t bisect and gob help me if I need to rebase before closing a PR. I made a comment elsewhere in this conversation that I think I might use 5% of git functionality. I like GitHub, but if I can’t use even that on their site I’m having a hard time imagining they provide ~20x value over git as underlying functionality.

acdha · on Sept 1, 2022

There are a handful of deep Git features like rebase or bisect which GitHub doesn’t expose but those aren’t things most people use frequently. Git has no equivalent for the things people do use all of the time: the issue tracking system, code review with all sorts of rules and approvals, the CI/CD system, package management for manage languages, not to mention newer features like Codespaces.

That’s a ton of features which cause people to use services like GitHub or GitLab, and it’s not like you’re giving up any of the CLI functionality to do so. My point wasn’t that these services are perfect but rather that there’s way more to it than setting up a Linux box you can push to.

eyelidlessness · on Sept 1, 2022

I don’t disagree with anything you’re saying other than the relative scale of what each provides. Like I said, I like GitHub. I just think it adds less to git than git adds to it. And most of their features are great, but I’d sure rather a nice distributed interface to bisect than an IDE in browser or issue forums (which are useful too!).

saagarjha · on Sept 1, 2022

I believe interactive rebases are my most frequently used git command :)

acdha · on Sept 1, 2022

I commit more frequently, yes, but I will note that it works great in Codespaces if you don't want to run it locally.

therealmarv · on Aug 31, 2022

hm, git is distributed and works offline https://git-scm.com/about/distributed

profmonocle · on Aug 31, 2022

It "works offline" in that you can create commits, view project history, and view every branch while offline. But fetching and pushing are such a common part of an engineer's day-to-day workflow that a poorly-timed outage of your remote repo is very disruptive, especially if you use git for deployment.

acdha · on Aug 31, 2022

This is technically true but the number of GitHub outages which have prevented you from doing that for more than a couple of minutes is pretty low. In comparisons like this, the more important question is not “is GitHub perfect?” but rather “what are you comparing it to?” — internal systems are notorious time-sinks and productivity levels from using GitHub normally are high enough that I think it'd be quite fair to conclude that you're still well ahead of where you'd be even if you have an extended coffee break once or twice a year.

astrange · on Aug 31, 2022

You can fall back to the actual intended git development model, emailing each other patches with `git format-patch` and `git am`.

Not that code reviews of diffs over email are all that great.

qudat · on Aug 31, 2022

Not to mention if you rely on gh actions for ci/cd. I think it makes sense for them to migrate to git and github, but I've been slowly migrating most of my code to sr.ht or self-hosted mirrors. Email patches work pretty well for smaller teams.

encryptluks2 · on Sept 1, 2022

Git is literally an SSH sever so management is not a complex as you think.

baq · on Sept 1, 2022

it's disruptive the first time you have to do it without github; afterwards it should be only a few tweaks in your git repo url configs

ranman · on Aug 31, 2022

FWIW not all of those outages were the core git/web product. A lot of those were GitHub actions or other associated functionality... but yeah it goes down disturbingly often given how much we all depend on it.

toastal · on Sept 1, 2022

As roused as people get about browser monoculture, they should be doing the same about Git forge monoculture and centralization.

jorblumesea · on Aug 31, 2022

Have you ever used SVN?

It's like git, but even more connected to a centralized server.

lapinot · on Aug 31, 2022

I think GP is not talking about git in general, but about choosing a free-tier hosting by an american commercial entity, and not by the project itself or some other umbrella organization.

ReptileMan · on Sept 1, 2022

Are there that many active (and high value generating) developers from the sanctioned countries to be impediment?

Not being on github also has costs.

It's all about the tradeoffs.

kalleboo · on Sept 1, 2022

Don't those sanctions also apply to the largest maintainers/contributors/mergers of WebKit (Apple)

stusmall · on Aug 31, 2022

I don't think WebKit reasonably sees itself as a risk for US sanctions unless they have an open source money laundering feature that no one has told me about.

olliej · on Aug 31, 2022

Awwwww, I remember back in the days of dealing with CVS, where there was so much scripting to try and manage basic stuff we take for granted like creating patches that included new files.

Subversion was so undeniably superior that everyone was super happy and instant. Git took much much longer as the complexity vs. the win was much more debatable to people, so it's interesting to see this finally happening - I will miss linearly increasing revision numbers though.

Glad to see they're keeping with bugzilla though - for whatever reason I find the GitHub issue tracker super annoying. Presumably at least part of that is familiarity and/or change resistance :D

usefulcat · on Aug 31, 2022

I remember when CVS was the new hotness. I was on a team of 3 at the time (90s), and one of our members worked remotely, so the fact that it was actually usable over a dial up connection was a killer feature for us. Also pretty much anything was better than SourceSafe, which is what everybody else in the company was using.

a-dub · on Sept 1, 2022

what was it again?

export CVS_RSH=ssh -z9

sourcesafe was trash. i migrated a pretty massive project off of it (browser, os, several bsps) and like 30% of the revisions were irretrievable.

mrweasel · on Sept 1, 2022

Subversion had it's quirks initially. It didn't work on NFS for instance, because the default backend was Berkeley DB, or some version of it.

Looking back Subversion was a bit of a weird project. It didn't re-evaluate what was wrong with CVS. It was more a "Let's try that again, but fix a few obvious problems". Subversion didn't really contribute with that much overall. We could do branches more simply, but most of us just replaced the cvs command with svn and continued to work as before.

dtgriscom · on Sept 1, 2022

> Subversion was so undeniably superior...

I was fine with CVS, and I'm fine with Git now. But, for some reason, I could never wrap my head around Subversion. Something about revisions-as-a-directory-tree just messed with my neurons.

dwaite · on Sept 1, 2022

> Something about revisions-as-a-directory-tree just messed with my neurons

Do you mean tags and branches?

a-dub · on Aug 31, 2022

> Subversion was so undeniably superior that everyone was super happy and instant. Git took much much longer as the complexity vs. the win was much more debatable to people, so it's interesting to see this finally happening - I will miss linearly increasing revision numbers though.

p4 and then git were easy sells for large projects. while subversion was faster than cvs with its local hidden copy, many operations were still dog slow as they'd scan the whole repository (this was often worked around by creating lots of small repositories with associated wrapper scripts). p4 and git on the other hand were designed to handle large trees with ease. so for something on the scale of an operating system, browser, or both... the difference in productivity was significant. (tens of minutes vs single digit seconds for basic operations)

JohnTHaller · on Aug 31, 2022

With the bugzilla issue tracker, they keep ownership.

olliej · on Aug 31, 2022

What do you mean? (as in ownership of what I guess?)

favorited · on Aug 31, 2022

TIL about `git rev-list --count HEAD`. I've been spelling it `git rev-list HEAD | wc -l` for years.

jamal-kumar · on Aug 31, 2022

Yeah I think that's one of those features that comes in handy when you're developing on Windows with git

favorited · on Aug 31, 2022

Interesting – what makes it particularly handy on Windows?

I've only used it for automating build numbers. The number of commits on the main branch behaves, in practice, close enough to a monotonically increasing counter that it works 99.9% of the time without anyone thinking about it.

klodolph · on Aug 31, 2022

“Handy” because Windows doesn’t have the same set of utilities like wc.

mgdlbp · on Sept 1, 2022

  git rev-list HEAD | measure

formally `Measure-Object -Line`

jamal-kumar · on Sept 1, 2022

I need to get better at powershell but it keeps on annoying me massively with basic stuff you can do in unix shell since the 80s, 90s or 2000s that you still can't do in PS. It needs to reach a sort of usability parity for sure.

One of those things I ran into recently was trying to find an equivalent of the following thing I do a lot of in *nix programming: Finding files of a certain pattern in grep, only list them, and then pipe that through something like awk or perl to find/replace in each file for each of those found files in that given grep. That's super easy in *nix. Powershell still doesn't have such an equivalent. It came of age in a time when everything as text and etc from unix principles weren't considered.

mgdlbp · on Sept 1, 2022

I think the philosophy is that as much as possible is done by composing cmdlets within the PS ecosystem, Select-String being kind of like grep and PS itself being the one language for all scripting. There's a lot that 'awk or perl' could map to; here's a (made-up) example of converting dates in certain files from 'August 1, 2022' to '2022-09-01':

  sls '(ERROR|WARNING)' *.log | %{$_.Path} | %{ (gc $_) -replace '\b\w+ \d\d?, \d{4}\b',{
      [datetime]$d = 0
      [datetime]::TryParseExact($_.Value, 'MMMM d, yyyy', [cultureinfo]::InvariantCulture, 0, [ref]$d)
        ? $d.ToString('yyyy-mm-dd')
        : $_
  } | Out-File $_ }

PS's advantage: leaning on .NET for doing things the right way.

The above has mostly top-level pipes like the Unix equivalent. You could also start with Get-ChildItem and skip the Select-String, instead having a conditional within a ForEach-Object, which certainly can masquerade as awk:

  ls -file|%{gc $_|% -begin{"Counting in $_";$n=0}{if($_-match'\b\d{4}-\d{2}-\d{2}\b'){$n++}}-end{"$n lines"}}

demonstrating that PS one-liners can be just as readable as those of Unix tools!

But, as you've identified, the pain in pipelines with native commands is real. No subshells or named pipes, naturally. All command output is parsed and re-serialized—anything binary, or with linefeeds, or UTF-8 without BOM is, as expected, silently corrupted. At least that's being worked on (https://github.com/PowerShell/PowerShell/issues/1908) -- the hindsight shell manages to have an immense amount of WTF locked in, turned into wontfixes by Microsoft's backwards compatibility. Tour the footgun arsenal: https://github.com/PowerShell/PowerShell/issues/6745

Oh, and the dreadful slowness. But is it worth it, to no longer fear self-pwn by file naming?

jamal-kumar · on Sept 1, 2022

I do like how it's a bit more semantic but find/replace for all instances of files that contain a search term in any .sh is like

  perl -p -i -e 's/term/replacement/g' | $(grep -rl term)

It's unbelievably fast and handy when you have multiple directories to recurse through or or tons of files. Anyone can remember that after using it like twice. Powershell requires like five lines of code just to get that done as you demonstrated. That's not feature parity at all. I also strongly feel it's way more readable than your example.

mgdlbp · on Sept 2, 2022

Oops, the PS ? and : actually need to be on the same line. Whiteboard fail. Likewise there shouldn't be a pipe before grep, right? So,

  perl -pi -e 's/term/replacement/g' $(grep -rl --include=\*.sh term)

Apples-to-apples, the date conversion with perl is,

  perl -pi -e 'use Time::Piece; s/\b(\w+ \d\d?, \d{4})\b/
      eval{Time::Piece->strptime($1, "%B %d, %Y")->strftime("%Y-%m-%d")} or $1/eg' \
    $(grep -ilE '(ERROR|WARNING)' *.sh)

and the simple replace with PS is

  ls *.sh -r -File | %{($_|gc) -replace 'term','replacement' | Out-File $_}

Not Perl terseness, but not 5 lines either (maybe in Visual Basic).

jamal-kumar · on Sept 1, 2022

Yeah that

Some of us are bound to stuff that doesn't exactly work under Linux, for platforms such as commercial video game consoles or business

cerved · on Aug 31, 2022

in git for windows, wc is included

klodolph · on Sept 2, 2022

It makes a lot of sense for some people to install Git into their path but not stuff like wc, since adding everything to your path can interfere with standard Windows utilities. Then you can use Git from PowerShell, but you won’t have wc.

This is not an unusual or bizarre configuration, it’s a very reasonable configuration.

xfmpXIe76lF4GfR · on Aug 31, 2022

Depending on which git distro you installed and what options you chose, sure.

fomine3 · on Sept 1, 2022

Generally Git for Windows includes bash and coreutils, but it's not work on cmd or PowerShell unless it's added to PATH, so it would be useful sometimes.

jamal-kumar · on Sept 1, 2022

Yeah I don't install it from the MSI, takes longer to update

cerved · on Sept 1, 2022

like I said, wc is a part of git for windows

WorldMaker · on Aug 31, 2022

On the automating build numbers side, `git describe` is really handy. By default it's {last annotated tag name}-{commit count since}-g{hash prefix of most recent commit}. (There's an optional --dirty flag to include a marker when the repo isn't clean status. There's options to control which tags are eligible for the first part.) It's basically a full build version identifier from a single command. Generally the only things to do to make it 'semver compliant' is to swap the second to last dash with plus and the last dash with a dot, depending on how you expect to use it with semver. (v1.2.3-18-g123456 => v1.2.3+18.g123456)

Bonus is that almost every git command that takes a "commitish" descriptor (git switch -c, git log, etc) can work with a `git describe` output as "commitish" name for a branch state.

doe88 · on Aug 31, 2022

Is there a reason why it seems there is so little documentation/comments in source files of WebKit? Or maybe I'm missing something/opened the wrong files.

domenicd · on Aug 31, 2022

There's some policy about comments being bad / code should be self documenting.

From what I hear from my colleagues on the Chrome team, one of the first things they were delighted to do after forking Blink was to finally go around and comment the various confusing parts of the codebase. (And, no longer get requests to remove comments when trying to land a PR.)

habosa · on Sept 1, 2022

I've met some very smart and capable programmers who perpetuate this "style". It's so harmful. Comments are good for the writer AND the reader.

doe88 · on Sept 1, 2022

Thanks, for the reply, what such a weird policy. I found it so uncanny at first, that I even thought that they had somehow internally a system where they were able to automatically merge some comments and documentation with their source files.

saagarjha · on Aug 31, 2022

Most WebKit developers are good at documentation, it’s just that they often work on things that their employer would not like being made obvious because it deals with SPI or unreleased products or security vulnerabilities. Commit messages are actually pretty good for the most part except in these situations where a laconic or purposefully misleading message will be used.

mnutt · on Aug 31, 2022

Documentation has been a bit of a challenge in my experience. There are some high-level docs at https://trac.webkit.org/wiki though many are 10-15 years old at this point. My approach has been to look at the commit history for the file to see if the changesets shed any light, and sometimes go to the attached bugzilla link to see if there was any discussion about the change there. Then attach a debugger and step through to try to uncover how the classes relate to one another.

tuankiet65 · on Aug 31, 2022

You aren't opening the wrong file, there isn't much documentation in WebKit besides a few Markdown files. I'm not sure why this is the case.

tibbydudeza · on Aug 31, 2022

What does Apple use for source control ???. I know Google is all custom and MS uses (???) a modified instance of Perforce.

bragr · on Aug 31, 2022

Microsoft maintains the entire Windows source tree in Git now and has made some really interesting contributions to git where it comes to very large projects, though I don't know about the penetration throughout other dev groups. They also own Github now obviously.

modeless · on Aug 31, 2022

I was just looking at Microsoft's git VFS (https://github.com/microsoft/VFSForGit), which is deprecated and now points to Scalar (https://github.com/microsoft/scalar), which is also deprecated I think? What's Microsoft's story with git for large repos now? Is there still a virtual file system involved at all?

jsmith45 · on Sept 1, 2022

Microsoft is trying to get away from needing a VFS. Scalar eventually accomplished that by combining some enhancements that had been added to git since they originally designed the Git VFS, and plus some new features that only live in scalar.

The long term plan is for Scalar to either go away completely, or possibly just be a simple front end for setting up a repository to use certain optional git features.

They actually have a version of scalar in official git's contrib folder, and they are very actively working with the core git team to convert the relevant features into core git features in whatever way the core git team is happy with.

But of course the present situation is certainly confusing, and does not seem to be well documented. I've only picked up on some of this from reading the git mailing lists, and have no idea how to actually use scalar.

fomine3 · on Sept 1, 2022

As Scalar README says, is it merged to microsoft/git ?

modeless · on Sept 1, 2022

That doesn't really answer my questions though. What exactly does microsoft/git do that makes monorepos better? The readme doesn't answer. They have a branch called vfs but the readme doesn't mention virtual filesystems at all.

caycep · on Aug 31, 2022

was this before or after the github acquisition? I figure if they were going to spend the money for github they probably intend for git to have a bigger role

WorldMaker · on Aug 31, 2022

Microsoft started the Windows transition to git sometime before Github acquisition as a part of a company-wide effort to move to VSTS/Azure DevOps Repos internal dogfooding. The choice there was between the zombie ghost of TFS source control or git, so you can possibly imagine some of the reasons why they chose to put a lot of time/effort into making git work for themselves. The subsequent acquisition of Github seems to have made them even happier at the switch to git.

dark-star · on Aug 31, 2022

long before

astrange · on Aug 31, 2022

Apple has traditionally left absolutely everything up to individual teams.

flatiron · on Aug 31, 2022

Everything I touched at apple while I was there was internal GitHub.

saagarjha · on Sept 1, 2022

A lot of teams use BitBucket. The PIE GitHub sees most of its use inside of certain orgs.

fr2null · on Aug 31, 2022

MS uses git with Azure DevOps, at least for all the repos I'm familiar with.

userbinator · on Sept 1, 2022

It seems they used their own proprietary solution before starting to support git: https://en.wikipedia.org/wiki/Azure_DevOps_Server#Source_con...

saagarjha · on Aug 31, 2022

Lots of Git projects. Internal development has centered around using Git, even for WebKit, for several years.

aendruk · on Aug 31, 2022

That’s a hefty repository if all you want [1] is part of it. Subversion offers a way to download just a subdirectory; is there an analogous solution for Git?

[1]: https://github.com/HimbeersaftLP/ios-safari-remote-debug-kit...

gman83 · on Aug 31, 2022

sparse-checkout -- https://github.blog/2020-01-17-bring-your-monorepo-down-to-s...

cerved · on Sept 2, 2022

no, that's still a full download, just the checkout is sparse

cerved · on Sept 2, 2022

you're looking for partial clone

https://git-scm.com/docs/partial-clone

notriddle · on Aug 31, 2022

GitHub does: https://docs.github.com/en/get-started/importing-your-projec...

a_t48 · on Aug 31, 2022

`git subtree` but it might still download the whole history

cerved · on Sept 2, 2022

subtree is for extracting / pulling a subtree, a different problem

a_t48 · on Sept 3, 2022

Is a sub tree not a folder? I recall using it before for syncing subsets or another repo.

lxe · on Aug 31, 2022

> git’s distributed nature makes it easy for not just multiple developers, but multiple organizations to collaborate on a single project.

Git is as "distributed" as Ethereum at this point. You have a central repo on github onto which you push changes. Just because you have a copy of the main branch when you're working on stuff, it's no different from SVN.

Yes it has the capability to be distributed, and individual contributors can certainly host their own git servers and you can have as many remotes as there are contributors, but we aren't doing things this way are we?

samatman · on Sept 1, 2022

This is the kind of thing you say when you've never worked with a centralized version control system before, because if you had, you wouldn't.

simonw · on Aug 31, 2022

I think the point here is that you ARE doing things that way if you are multiple companies collaborating on something at the scale of WebKit.

ipaddr · on Aug 31, 2022

You are seeing the tip of the iceburg. The most discoverable online platform is github and others exist at much smaller numbers. Because these are public they are easy to see and count. Private instances are hidden by default which makes them hard to count.

tropicalfruit · on Sept 1, 2022

I love svn, it puts some constraints on other non technical people. That makes my life easier.

colordrops · on Sept 1, 2022

Could you be more specific?

tropicalfruit · on Sept 1, 2022

- it's more difficult to do "agile", churning out lots of code, working simultaneously on the same pages, or functions between different teams, which then needs to be merged together by some unfortunate (possibly clueless) dev. it puts a restraint on ignorant management who don't appreciate downstream issues.

- merging and branch maintenance is more rigid with svn, so teams tend to have dedicated release teams to handles this, instead of offloading this onto devs - most devs now are expected to handle git merging and so on, not so with svn.

i worked about 10 years with svn, and 5 with git, and just my exp only of course.

joemaller1 · on Aug 31, 2022

Curious to see how this affects contributions.

brnt · on Aug 31, 2022

Could WebKit be a viable alternative to Blink, should Mozilla bite the dust?

filmgirlcw · on Aug 31, 2022

I mean, Blink is a fork of WebKit, so yes? But it’s also even less-responsive to upstream contributions so, outside of embedded systems that had previously adopted WebKit, I seriously doubt it'll recapture the traction it had before the fork.

Longhanks · on Sept 1, 2022

"recapture the traction"? It is used by literally billions of devices of varying form factors every day. It has probably 100x the number of users as Gecko/Mozilla Firefox and one of the wealthiest companies on the planet sponsoring development, and I don't see Apple betting on Gecko or another fork of Chrome anytime soon.

filmgirlcw · on Sept 1, 2022

Who said anything about Apple? I would never suggest Apple use another fork or the fork they incubated/created/developed. I was replying to a comment asking if WebKit, which != Safari could be a replacement for the Chrome-driven world we are in and when used by people who aren’t Apple.

And yes, it is absolutely bigger than Gecko. No question. But are you going to tell me with a straight face that WebKit has the same amount of traction with third-parties as Blink? Because sorry, that’s not the case. That was the case for about 5 years, when post iPhone, everyone and their brother decided to use WebKit to power their mobile browser for whatever mobile OS they were building for (Android, BlackBerry, Symbian/Maemo/Meego, webOS, Tizen) or for whatever embedded systems they were designing for in-car systems or whatever, but after the Google fork became demonstrably different, the surviving players in that arena switched to Blink because Google was faster at iterating and easier to work with for upstream commits (easier does not mean easy).

kitsunesoba · on Aug 31, 2022

Ideally, we need both Gecko and WebKit to be healthy, with additional promising alternatives on the horizon.

JohnTHaller · on Aug 31, 2022

Blink is a fork of WebKit. Apple manages WebKit development. Google manages Blink development.

no_way · on Aug 31, 2022

It is a fork, but that was really long time ago, most parts of both engines were completely rewritten so for all intents and purposes these are completely different engines.

filmgirlcw · on Aug 31, 2022

They’ve splintered a ton over the years for sure, but there are still similarities. But yes, this isn’t like the first few years when Blink was just WebKit with V8.

But on the whole, Blink is still more similar to WebKit than it is to Gecko.

capableweb · on Aug 31, 2022

Maybe not really long time ago, maybe around ten if I'm remembering correctly. But, maybe in web time, that's pretty long time ago.

sph · on Aug 31, 2022

Not any more viable than it is now.

Brendan Eich IIRC said they looked into building Brave on top of Webkit but it was so hard to compile and embed across all three platforms that they went with Chromium. Same story with Gecko.

So that's another reason why we now have a Blink monoculture: because the alternative engines didn't spend any effort in making them usable by third party applications.

pabs3 · on Sept 1, 2022

Hmm, WebKit is used by lots of apps on Linux these days; at least GNOME Web, GNOME evolution and Liferea.

sph · on Sept 1, 2022

WebKit is used by the WebkitGtk library which all of these include.

But it's been stagnating for years and it's still not a viable replacement for Gecko and Chromium (broken scrolling on scaled screens, WebAuthn is unsupported so I can't login to my email account, etc.)

In any case, if I recall correctly Eich's comment, the biggest difficulty was building webkit for Windows. The documentation was very out of dated, and it's understandable given Apple's focus on its OS. Even this GitHub says it's a web engine for "macOS, iOS and Linux".

Cyberdog · on Aug 31, 2022

What does Mozilla biting the dust have to do with WebKit being viable?

And even if they did, I think Gecko has enough of a fan base that it would live on for a very long time. NetSurf and Dillo are still around after all.

fomine3 · on Sept 1, 2022

If Mozilla bite the dust, unfortunately Firefox/Gecko developers will lost the job, then possibly some of them start developing WebKit or based browser on Apple or different corp. I don't want it.

vbezhenar · on Aug 31, 2022

If that’s the same WebKit powering Safari, why not. It’s the second most popular browser after all.