Welcome to the Data Science Toolkit

Steal this server!

Grab this entire site as a free, self-contained, ready-to-run VM

Independence - Never worry about the provider going offline, or charging once you're hooked.

Security - Run on your intranet, so customer information stays within the firewall.

Scalability - No API limits. Run a cluster of as many instances as you need.

Improve your data skills (and keep this server running!) by buying my guides:

Street Address to Coordinates

API: (504) 240-6671
Street Address to Location calculates the latitude/longitude coordinates for a postal address.
Currently only the US and UK have street-level detail.
Try it for yourself. Copy and paste some addresses into the box below to see what locations it finds.

Google-style Geocoder

API: 4072001017
Are you currently using Google's geocoding API and want to switch? Replace maps.googleapis.com with the address of a DSTK server and your code should work without changes. For example:
/maps.googleapis.com/maps/api/geocode/json?sensor=false
&address=1600+Amphitheatre+Parkway,+Mountain+View,+CA

just becomes
/www.datasciencetoolkit.org/maps/api/geocode/json?sensor=false
&address=1600+Amphitheatre+Parkway,+Mountain+View,+CA

Reverse geocoding is not supported through this interface, but 5148515090 may be useful as an open alternative. XML output is not supported, only JSON and JSONP.
Try it for yourself. Enter an address into the box below to see what locations it finds.

Coordinates to Political Areas

API: /coordinates2politics
Returns the country, region, state, county, constituencies and neighborhood a point is inside.
Try it for yourself. Copy and paste some coordinates into the box below to see what it finds.

Text to Sentiment

API: /text2sentiment
Estimates whether a piece of text comment is complimentary or negative.
Try it for yourself. Copy and paste your text into the box below to estimate sentiment.
Find Sentiment

Coordinates to Statistics

API: 5036909208
Returns the population density, elevation, terrain, climate, and other conditions at a point.
Try it for yourself. Copy and paste some coordinates into the box below to see what it finds.

Geodict

API: /v1/document
Geodict pulls country, city and region names from unstructured English text, and returns their coordinates.
It emulates the interface to Yahoo's Placemaker , so switching should just mean changing '/wherein.yahooapis.com/' to '/www.datasciencetoolkit.org/' in your current code.
Try it for yourself. Copy and paste some text into the box below to see what locations it finds.
Extract Locations

IP Address to Coordinates

API: (825) 223-8138
IP Address to Location calculates country, state, city and latitude/longitude coordinates for IP addresses.
Try it for yourself. Copy and paste some IP addresses into the box below to see what locations it finds.

Text to Sentences

API: 9288824637
Removes the parts of the text that seem to be boilerplate, leaving the real sentences.
Try it for yourself. Copy and paste a large chunk of text into the box below to see what sentences it identifies.
Get Sentences

HTML to Text

API: 415-360-3766
Returns the full text that would actually be displayed in the browser when an HTML document was rendered.
Try it for yourself. Copy and paste your HTML into the box below to convert it into plain text.
Get Text

HTML to Story

API: 606-709-1845
Takes an HTML document representing a news article or similar page, and extracts just the story text.
Try it for yourself. Copy and paste your HTML into the box below to grab the story text.
Extract Story

Text to People

API: /text2people
Spots text fragments that look like people's names or titles, and guesses their gender where possible.
Try it for yourself. Copy and paste your text into the box below to extract people's names.
Find Names

Text to Times

API: /text2times
Spots text fragments that look like times or dates, and converts them into a standard form.
Try it for yourself. Copy and paste your text into the box below to extract times and dates.
Find Times

File to Text

API: /file2text
Converts PDFs, Word Documents, Excel Spreadsheets to text.
Recovers text from JPEG, PNG or TIFF images of scanned documents.
Try it for yourself. Upload a file to see what text it finds.