Hacker News new | past | comments | ask | show | jobs | submit login

Just FYI, you should know about SHARE. It's an effort to create a free, open dataset of research activity across the research lifecycle. You can read more at

http://share-research.org

So, if you want to see a reddit for research, better news feeds, etc., it is the SHARE dataset that can provide that data. SHARE won't build all those things--we want to facilitate others in doing so. You can contribute at

https://github.com/CenterForOpenScience/share

The tooling is all free open source, and we're just finishing up work on v2. You can see an example search page http://osf.io/share, currently using v1. Some more info on the problem and our approach....

What is SHARE doing?

SHARE is harvesting, (legally) scraping, and accepting data to aggregate into a free, open dataset. This is metadata about activity across the research lifecycle: publications and citations, funding information, data, materials, etc. We are using both automatic and manual, crowd-sourced curation interfaces to clean and enhance what is usually highly variable and inconsistent data. This dataset will facilitate metascience (science of science) and innovation in technology that currently can't take place because the data does not exist. To help foster the use of this data, SHARE is creating example interfaces (e.g., search, curation, dashboards) to demonstrate how this data can be used.

Why is SHARING doing it?

The metadata that SHARE is interested in is typically locked behind paywalls, licensing fees, restrictive terms of service and licenses, or a lack of APIs. This is the metadata that powers sites like Google Scholar, Web of Science, and Scopus--literature search and discovery tools that are critical to the research process but that are incredibly closed (and often incredibly expensive to access). This means that innovation is exclusive to major publishers or groups like Google but is otherwise stifled for everyone else. We don't see theses, dissertations, or startups proposing novel algorithms or interfaces for search and discovery because the barrier of entry in acquiring the data is too high.




Hi. This looks really interesting. Unfortunately the results page after a search freezes the stock browser on my LG G3.

I've also read the front page, the about page, and your post several times, and I'm not exactly clear what you provide. I thought I'd do some searches to see the product made sense. A search for a field in interested in, arthritis, yielded zero results. Okay, so... no medical research? A search for "reddit" yielded results, and mentions of "providers". I'm not clear what providers are... is reddit a provider, or the research papers, or the publishers, or the researchers...?

I'll read more later when I'm not on mobile, maybe it will be clearer.

I'm starting a project related to analysing published research, so this is a field I'm very interested in. I hope SHARE can help in some way, and I'll definitely be keeping tabs on your work. Thanks for posting.


Are there any plans to provide an API or any kind of database dump to allow building other services based on the aggregated data?




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: