More

datageek · on Feb 26, 2012

the algorithm needed to execute quickly. image analysis takes too long.

kabdib · on Feb 26, 2012

What's the bottleneck? Bandwidth for downloading images? Understanding of an algorithm that would do the job?

petewarden · on Feb 26, 2012

Bandwidth and general cumbersomeness of dealing with larger amounts of data with starving-startup resources. I actually spent about a decade of my career focused on image processing, and while I love it's power, I knew how much of an engineering challenge it can be at massive scale. I need to do a blog post about this, since I know my choice is a bit surprising and needs explanation.

marshallp · on Feb 26, 2012

Difficulty in creating an algorithm.

There are ways to get it done algorithmically, however, the challenge is in getting enough data. 30,000 images is too low. you would need a few million, then just simple machine learning algorithms would work.

The latest machine learning techniques such as unsupervised deep learning might work, with millions of unlabeled images and the 30,000 labelled.

lwat · on Feb 26, 2012

Technically I wouldn't call it a 'Photo quality algorithm' in that case

datageek · on Feb 1, 2012

Kaggle - San Francisco, CA

We're looking for:

* Data scientists

* Developers (REMOTE)

* Technical sales

More information at http://www.kaggle.com/pages/jobs

Kaggle has just closed a large Series A ($11.25m). Our early employees will help shape Kaggle's direction and grow along with the company. Regardless of the position, you should have a strong interest in data science and the intellectual curiosity to engage with competition clients from a wide variety of fields.

Kaggle is aiming to build a meritocratic marketplace that will change the way data science gets done. Read more at: http://www.businessweek.com/magazine/kaggles-contests-crunch...

datageek · on June 28, 2011

FWIW, Martin O'Leary doesn't sound like a Jewish name.

creativeone · on June 28, 2011

ONLY 40% of Physics nobels are given to jews, that leaves 60% to non-Jews. So, its not surprising that Martin O'Leary isn't Jewish. Unless his mom is and he took his dad's non-Jewish name. :)

datageek · on Nov 23, 2010

Doesn't seem to be anything stopping ppl from including other data.

pufuwozu · on Nov 24, 2010

You're welcome to bring additional data as long as it's publicly available.

http://kaggle.com/view-postlist/forum-29-rta-freeway-travel-...

sukuriant · on Nov 24, 2010

"This competition requires participants to predict travel time on Sydney's M4 freeway from past travel time observations". This line seems to suggest that the past travel time is the most important part of the experiment; however, as one other (rrrhys) pointed out, the data is useless, since the road has changed, and the grandparent of this post mentioned sporting events affecting traffic.

All of that said, perhaps a strong model can be generated using just historical data.

datageek · on Sept 12, 2010

Build a better chess rating system and enter your system into the following competition: http://kaggle.com/chess.

You may want to use machine learning techniques, which you can learn using the Andrew Ng's Stanford lectures (http://www.youtube.com/watch?v=UzxYlbK2c7E&feature=chann...).

datageek · on Aug 16, 2010

Bustaname.com has some neat functionality, whereas Bruteforcenaming.com is super simple.

datageek · on Aug 3, 2010

at 50 years old - it's probably due for an upgrade!

jacquesm · on Aug 3, 2010

Instead of being upgraded the ELO system is actually being applied to other sports because of its solid statistical basis.

ulvund · on Aug 6, 2010

It is solid and it is easily computable

datageek · on Aug 3, 2010

Benford's Law is really simple and neat. Worth reading about in isolation http://en.wikipedia.org/wiki/Benford%27s_law

datageek · on Aug 3, 2010

Possibly the start of an arms race between fraudsters and data miners?

datageek · on Aug 2, 2010

I wonder if there's a business model in this?