One of my pet ideas is a new programmer's editor that uses proportionally as muc...

yohanatan · on Nov 6, 2014

I have a similar idea where the cores are put to work optimizing the produced binaries (profile-guided optimization, genetic algorithms, SAT solvers/theorem proving, etc). You could of course also do static analysis and all forms of testing (generative, property-based, unit, integration, performance, etc.).

ThisIBereave · on Nov 5, 2014

It should be noted that public github repos are _not_ public domain and not even necessarily open source unless the author specifies a license.

timr · on Nov 5, 2014

It's part of the GitHub terms of service:

"By setting your repositories to be viewed publicly, you agree to allow others to view and fork your repositories."

rtb · on Nov 6, 2014

Permission to "view and fork" is not the same as permission to copy and paste into a proprietary software product (i.e. this may not be legit for use by "enterprise" developers)

herdrick · on Nov 10, 2014

http://www.engadget.com/2014/11/09/darpa-pliny-coding/

tdicola · on Nov 5, 2014

Interesting idea, would definitely make you increase the efficiency of your time at the editor if it costs you $50/hour! But you don't need 1000 cores to fire off a bunch of search requests. Network calls are generally IO bound so even one or two cores can handle lots of requests.

andrewstuart2 · on Nov 5, 2014

> all public github code

In RAM.

dbarlett · on Nov 6, 2014

Some napkin math - as of late 2013, GitHub's main Elasticsearch cluster had ~128 shards of ~120 GB [1]. Obviously they have more data now, and it's not clear whether the number of shards includes replicas, but at current EC2 on-demand prices[2]:

   RAM: 63  r3.8xlarge @ $2.80/hr = $176.40/hr
   SSD:  5  i2.4xlarge @ $3.41/hr = $ 17.05/hr
   HDD:  1 hs1.8xlarge @ $4.60/hr = $  4.60/hr

[1] http://www.elasticsearch.org/case-study/github/

[2] http://aws.amazon.com/ec2/pricing/

X-Istence · on Nov 6, 2014

We don't even need to keep ALL of it in memory, we would just need to keep everything for the current language we are developing in.

Maybe one or two others. Say I am working on a Angular/Pyramid application, all I would need is JavaScript and Python loaded.

yohanatan · on Nov 6, 2014

A hybrid multi-layer cache system would likely be more cost effective and still perform nearly as well as dedicated RAM (depending on how the data is structured/sharded/queried).