Hacker News new | past | comments | ask | show | jobs | submit login
Twitter Acquires Gnip (gnip.com)
129 points by noinput on April 15, 2014 | hide | past | favorite | 41 comments



So Twitter basically acquired their own data? Hmm.. more proof I dun know how Silicon Valley works.


I'm a customer of Gnip's - they provider several enrichments and data normalizations on top of Twitter's stream. Could Twitter have hired X engineers and duplicated what Gnip does? Perhaps - but Gnip also has an existing customer base, existing revenue flow, infrastructure setup to handle customers creating their own matching rules etc.

There's more to Gnip than just re-selling the Twitter firehose.


Gnip also integrates with a number of other services as well


Will these sites want Twitter to have firehose access to their data? For example Tumblr, Foursquare, Wordpress? They also offer Facebook data, but it is a managed public API connection.


It's a very good question. I really have no idea what the implications are and probably depend on each company.


I'll try to simplify it as much as possible.

Capital is raised by companies to accelerate growth. Growth acceleration can manifest itself in many ways. Specifically, in this case, as you rightfully point out, Twitter could easily build this service themselves. They also, at the time of Gnip's inception weren't totally sure there was a business model here.* So they let another organization take that risk for them, instead of using their own capital. They end up paying more capital in the long run but the risk is reduced.

*I recognize that most likely the certainty of Gnip's business model is (1) much higher due to the "duh, of course people want this" nature of their business and (2) that the people investing here are all "in this together". It, however, doesn't negate the fact that there is indeed risk.


Or buying their way out of the data contract Gnip has?


Twitter provides their raw data to multiple outside companies. These companies compete with each other to add value to the raw data, while Twitter focuses on solving its core problems. Everyone wins. (Except Twitter's users, who make these companies millions of dollars without a penny in return.)


I would imagine Twitters users are getting some kind of value of out of the service. Otherwise they wouldn't use it.


Interesting: It seems twitter now does see distributing their data as a valid businessmodel (before they 'outsourced' this to DataSift and Gnip). Wonder what it will mean for the other sources Gnip has, and for the customers of Datasift.


I've often wondered if there's any hidden terminology in some of these press release, eg: "http://venturebeat.com/2014/04/15/twitter-buys-gnip-taking-o...

"To that end, we have agreed to acquire Gnip" - I'm wondering for example if the "agreed to acquire Gnip" means that Twitter doesn't see this as a high-value acquisition on the same level as Crashlytics or Mopub.


I found that phrasing weird in the blog as well: "Today I’m pleased to announce that Twitter has agreed to acquire Gnip".

Makes it sound like they almost forced the acquisition on Twitter.

I'm not sure that Gnip is a 'high value acquisition' for Twitter though - AFAIK they're basically just re-streaming Twitter fire hose data which Twitter could do themselves. This is probably just a way for Twitter to lock down/get even more control over their ecosystem.


The wording is generally used because the transaction has been agreed t though it may not have been completed officially / legally.



Maybe that language indicates who has the leverage in the deal and thus reflects in their stock price - post-buy.

It would be interesting analizing the phrases surrounding acquisitions both public and private may reveal this...


I guess that makes Twitter data cheaper than before? Last time when I contacted them for one week of worth tweets (with some filters on top) they wanted me to pay tens of thousands dollars. I ended up crawling the data myself.


were you able to get firehose access for that? or were you just spidering the public tweet stream?


Most likely spidering; its difficult/expensive to get firehose access.


no, i just used streaming api to gather the data. you can crawl surprisingly more data with streaming api as opposed to regular api.


does this mean that they will no longer sell non-twitter data like Foursquare?


I wonder what this will mean for Gnip's biggest competitor, DataSift.



Best of luck to the Gnip guys -- they're a great group of people.


Congrats to those guys. I worked with their service for about a year and they had a great product.


Interesting, I had gotten the impression Twitter was aligning itself more with DataSift in recent years. I used Gnip a few years ago and thought it was a great service, although the high price point drove us to ultimately roll our own.


By "roll our own" do you mean you just used the twitter streaming API? Did that have enough data?


Yes, Twitter streaming API plus several other data sources (Facebook, Reddit, forums, etc.). It's enough for us since we're always filtering by keywords or users. We were also granted elevated access.


What was the process like for elevated access? I'm deciding between using Gnip or Twitter's streaming API today actually.


It was pretty straightforward in our case. We had to show them that we had use cases that weren't feasible with the default access level and also demonstrate that we weren't competing with their user experience at all. Not sure how much it mattered but we had been using their streaming API for almost as long as it existed.


I had the same question when I read this. The streaming API is only a fraction of the data offered by the firehose, but that's not as much of a problem in every use case.


Not surprising to see Twitter finally acquire Gnip after working with them for four years. A great achievement for the team at Gnip, and will be interesting to watch the data model they intend on developing with Gnip as time goes on.


Sorry, off topic, but:

Question: is Twitter's free "garden hose" sampled data stream still available?

I used to use the small garden hose sample of near real time tweets. I just tried running an old script to fetch the garden hose and it didn't work.

On topic: I thought that the Gnip business model was interesting, and I can't help but think that the other data sources that Gnip processes and resells may not like this aquisition.


Yes it is.

The API changed and now requires authentication, so that might be why your script wasn't working.


Are you sure the Garden Hose has free access? I can't find any public access to it in the docs, everything directs me to their "data partners", which charge for this information.


I'm assuming you and the OP are referring to the Streaming API? More info here: https://dev.twitter.com/docs/api/streaming


Mazal Tov, but this isn't the game changer I was hopping for (:


Congrats!


Either Ghostery or AdBlock prevent the actual content of the blog post from loading. Please don't rely on Javascript to load content like that.


Not sure why you're being downvoted but I logged in to same the same thing. This blog is blank to me.


He is being downvoted because it is crazy to expect websites to work around the bugs in various 3rd party plugins. This is an issue for the plugins to fix.


I would argue that relying on Javascript to display what ought to be a simple static site is the problem here, but I also agree that perhaps the plugins ought to recognize what's going on and allow that JS to run.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: