Posts

Showing posts from August, 2013

2013-08-24: Two WS-DL Classes Offered for Fall 2013

Image
Two WS-DL classes are offered for Fall 2013: CS 725/825 - Information Visualization , Dr. Weigle CS 495/595 - Introduction to Web Science , Dr. Nelson Information Visualization has been taught twice before, but with a 795/895 course number.  This semester will be the first time that Web Science has been taught at ODU, although the course is based on Dr. McCown 's Spring 2013 class at Harding University. --Michael

2013-08-23: Archive-It Supports Memento

Image
Earlier this week , Archive-It (the subscription-based collection development service from the Internet Archive) implemented Memento support for their collections, including the newly established "all" collection.  This is a follow-on from the recent Internet Archive upgrade of their Memento support in the Wayback Machine .  Prior to Archive-It's support of Memento, their collections were included in the Memento aggregator  by proxy .  While dozens of archives are included in the aggregator via proxies, native Memento support is faster and more functional. Here is an HTTP snippet using an archived PDF of a NASA report from an earlier post about NTRS . % curl -I -L http://wayback.archive-it.org/all/http://ntrs.nasa.gov/archive/nasa/casi.ntrs.nasa.gov/19960028190_1996060846.pdf HTTP/1.1 302 Moved Temporarily Server: Apache-Coyote/1.1 Vary: accept-datetime Link: <http://ntrs.nasa.gov/archive/nasa/casi.ntrs.nasa.gov/19960028190_1996060846.pdf>; rel="origin

2013-07-26: Web Archiving and Digital Libraries workshop - WADL 2013 Trip Report

Image
On July 25th and 26th 2013, the WS-DL group attended the Web Archiving and Digital Libraries Workshop that was collocated with JCDL 2013 at Indianapolis, IN. Ed Fox , from Virginia Tech , opened the workshop by greeting the attendees. Then, Andreas Paepcke gave two presentations. The first presentation was entitled: "ArcSpread: Enabling Web Archive Analysis for non-CS experts". In this presentation, Andreas showed how to make the web archive useful outside the computer scientists. ArcSpread uses spreadsheet interface to help the user to gain information from the web archive. ArcSpread started with analysis activities such as filtering, aggregating, classifying, and manual coding. The output product is a spreadsheet that can answer some questions related to specific queries (e.g., Hurricane Katrina) such as: pages with words, images with the term, place/people name, and most frequent names. ArcSpread depends on sheet engine with Hadoop cluster of 60 nodes. The second pres