|
Storage: IMC’s web based Media Archive collections is stored with more than 5800 production minutes and >70 unique topics (status: 08/2012) and hosted since 2006 at IA (Internet Archive). Storage involves parsing, indexing and physically encoding the data.
|
... accidents and
natural disasters maintaining copies of IMC’s Media Archive collections at multiple sites can help alleviate this risk. Part of the collection is already handled this
way, and IA is proceeding as quickly as possible to do the same with the rest.
|
|
The Web datas are
received and stored in archive format of 100-megabyte ARC files made up of many individual files. Alexa Internet (currently the source of all crawls in IMC’s collections) is
proposing ARC as a standard for archiving Internet objects. Preservation: It is one of the most important issues and ongoing tasks of
|
|
Migration: Over time, storage of IMC’s media files can degrade to a point where the datas become permanently irretrievable. Although DLT tape is rated to last 30 years, the industry rule of thumb is to migrate data every ten (10) years. IA no longer use tapes for storage.
|
|
permanently protecting IMC’s stored media resources from damage or destruction. Accidents: The
main issues are guarding against the consequences of accidents and data degradation and maintaining the accessibility of data as formats become obsolete. Any medium or site
used to store data is potentially vulnerable to...
|
Data formats: As advances are made in software applications, many data formats become obsolete. IA will be collecting software & emulators that will aid future researchers, historians and scholars in their research within IMC’s media archive.
|