gusl: (Default)
NeedleBase, by ITA Software*, seems to be a practical way of building a database from unstructured sources. Besides the web scraping (Information Extraction), they have tools for data cleaning/merging, all while maintaining provenance information (i.e. every datum points to its original source). video tutorial

Dapper's Data Mapper is great for making RSS feeds out of mere URLs, e.g. this one I made for Charles Kemp's publications. Semantify might be useful for webmasters.

Freebase has never impressed me.

Tangentially, has anyone used a Web 2.0 application for socially annotating webpages as you visit them (e.g. leaving PostIt notes for your friends to see), or chatting with other people who are visiting them at the same time (social browsing)? I've never seen a good one.

Any thoughts on Flock?



* - soon to be merged into Google, as I found by reading the comments to this post.

smart text

May. 10th, 2006 06:59 pm
gusl: (Default)
Wouldn't it be great if everything you wrote automatically came with a (dynamic) comprehension test? ...if the author didn't even have to (explicitly) write the test, but it got automatically generated by "AI", with the help of semantic tags, with pieces of text being linked to logical statements?

This way, I would never need to read past a section that I didn't understand.

February 2020

S M T W T F S
      1
2345678
9101112131415
16171819202122
23242526272829

Syndicate

RSS Atom

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags