Hacker News new | past | comments | ask | show | jobs | submit login

What causes people to need to sprinkle license boilerplate everywhere, including in files which are otherwise completely empty (like r2/r2/config/__init__.py), and then to have to update them every year? See this commit: https://github.com/reddit/reddit/commit/90cfcaaecc56cf35e758...

It just seems to defy reason that we must make humans increment a number every year in every file in our projects, nevermind the fact that the top 20-30 lines of every file in our projects has been taken over by stuff most readers don't actually need to read (again and again).

Is this really the best we can do without somehow letting the bad guys take our home away due to some licensing gotcha? Like simply having this at the top of each file:

    # see the top-level LICENSE file



I do this too and so why do I do it?

In essence it's because there is a very large number of lazy programmers who live by cut and paste. We're not just talking the "I've found a solution on Stack Overflow and will use that", but more "I've searched Github for keyword + language and this file does what I need".

The files are copied into their projects in its entirety, sometimes whole libraries are, and those programmers never bother to check how a project is licensed.

Once this process has been repeated a few times the code is firmly detached from the licence and any original license is ignored.

If I use the suggestion you make, then by them copying files into their project they have changed the licence of a file (it now inherits whatever their project uses).

Though I do like the idea of a stub instead of the full thing:

    # Licence: BSD (3-clause) https://github.com/owner/project/LICENCE.md
That would be enough to describe the licence for the file in a way that survives cut and paste, whilst also providing a URL for the full licence details.

In fact, I will now probably shift to that.


> In essence it's because there is a very large number of lazy programmers who live by cut and paste.

I've had programmers copy and paste GPL'd code into proprietary projects I'm responsible for. It's not laziness it's ignorance. "What's a GPL?"


I've shipped hard product with GPL onboard and in use. Its great! I've also complied, 100%. Also great!

Laziness, ignorance, irresponsibility. Or: usage.


> I've also complied, 100%

The cards I'm dealt are "proprietary projects". I comply 100% too: we don't ship GPL'd code. We've gotten close, though (c/o what I mention above).


Programmers who copy and paste can always copy the fragment they need, no need to pay attention to headers. Ultimately licensing relies on people operating in good faith. Even without explicit licensing, there is an implicit copyright on works such as source code, so taking code from random places without checking out the license is never warranted either way.

> If I use the suggestion you make, then by them copying files into their project they have changed the licence of a file (it now inherits whatever their project uses).

No, the only one who can actually change the license of a file is the rights holder, so the person who copied the code while ignoring the license misrepresents matters but does not change anything about how the work is licencsed.


If you think that's bad, you should see the many people who tried to copyright an empty file: http://trillian.mit.edu/~jc/humor/ATT_Copyright_true.html



> What causes people to need to sprinkle license boilerplate everywhere, including in files which are otherwise completely empty

The Apache 2 license library has language that indicates the use is to put bits of the license in every file. That's why. It's easy enough to maintain a license at the top of files with an IDE like IntelliJ


>Is this really the best we can do

Perhaps in places like github. With central versioning systems where the server is under our control we simply run daemons that check the copyright. Each user can define how it should work for them. If copyright is not okay the user can either a) have the submit fail so he is notified that it needs fixing or b) let it be fixed automatically by the daemon.

This fixing also includes adding a copyright notice to new files that didn't have any. Nicely defined depending on the file type.

The implementation was a one time effort which now saves us from doing exactly what they are doing now. Manually going through thousands of files to fix a copyright.


It is an eye sore for me as well.

The CakePHP project has done away with yearly update by replacing the year(s) with '(c)'.

https://github.com/cakephp/cakephp/commit/7b860debe4731a9cbc...

I remember watching a Stephen Fry interview who mentioned that placing the Copyright symbol once on your piece of work is sufficient to claim Copyright. But is placing the symbol once on a book, the same as placing a Copyright/License block once in a project directory?


As a matter of fact, you do not need to even declare copyright anywhere in the text to claim copyright (at least in the US). Copyright exists from the moment of the work's creation. [1] And placing a copyright notice does not afford you any other benefits without registration anyway. Once you've registered with the US Copyright Office, you may place a copyright notice if you want, but your work is still protected even if you don't. [2]

[1] http://copyright.gov/help/faq/faq-general.html#register [2] http://www.copyright.gov/title17/92chap4.html#401


I'm dating a lawyer. According to them, what you've written is true, however, speaking practically, there is a significant advantage to be gained from presenting evidence. If two parties show up to a dispute with identical source code, the one that has a copyright in it has an advantage. Sure, it's easily faked, and that could be argued, however, it would be trying to argue away evidence that exists which is much more difficult than arguing in favor of something that does exist. So if you want to lock in a victory and reduce court time, use copyright notices (and other legal notices like trespassing signs, etc.) liberally.


It just seems to defy reason that we must make humans increment a number

Ah, but why do you assume a human did that? Writing a script to update the year in all files doesn't take more than a few minutes to write. Chances are he simply ran "update_license_year" and committed.

Likewise for having a license on every single file: it may simply be a git hook that preprends it to every file with a certain extension.


The legal department, not the programmer, probably dictates that the copyright headers must be in each and every file.


I know this ruins any potential value of the intellectual property but…

…why not just put a mention in the top-level license that all empty (0 byte long) __init__.py files are in the public domain?

(Yes, yes, it's less confusing to license the entire thing under one license. But attempting to assert copyright on an empty file is humorous.)


empty __init__.py declare a python module. see http://stackoverflow.com/questions/448271/what-is-init-py-fo...


I am well aware of why an empty __init__.py is necessary. It is less clear why one must add license boilerplate to such a file: https://github.com/reddit/reddit/blob/master/r2/r2/config/__...

> you may not use this file except in compliance with the License

This is patently absurd: the file contains no content other than the license itself, and arguably its name (which is shared by millions of other __init__.py files around the world).


Ah. Sorry, I didn't understand this point. I agree it is totally superfluous to declare a license on an empty file.

But I assume that they have this header on every file as part of their internal process. Hence, they don't make an exception for empty files. I would book it as a cost of this process.

Also, it is handy if somebody starts appending to it ( i.e. https://github.com/reddit/reddit/blob/master/r2/r2/lib/autho... ) , they don't need to take care of that the license is correct.


An empty file with no surrounding context is just an empty file, but is an otherwise blank file embedded within proprietary software whose presence is required for the software to function somehow public domain? The contents of the file are trivial, but it's existence may not be.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: