OERca2-importer-T

OERca2 Importer Tool

overview
The import tool will allow users to bring content into OERca. Users will need to bring in materials like powerpoint or pdf files, as well as individual images, videos, web pages, and zip files. The import tool should decompose documents when possible and extract individual content objects as well as text. The interface should allow the user to make a choice to decompose a document or not. The import tool should provide feedback about the progress of the upload. It should also allow users to import multiple items at once, including items of different types (i.e. a powerpoint file as a material and jpg file as a content object). When entering the import tool, the user should be presented with a field into which any kind of document information can be entered (i.e., a location on the hard drive, a url, a zip file location, CTools import etc.). The system should be able to recognize what kind of document is being imported rather than forcing the user to choose from several different fields for importing content from different locations. Lastly, the user should be able to link the uploaded file or files to preexisting documents already in OERca (versioning).

functionality

 * upload standard types of content (pdf, ms office, oo.org, adobe cs, ims cc, ims cp, zip, video, audio, images, html, xml) from typical points of entry (hd, usb, url, rss)
 * select multiple files for import
 * choose which files will be decomposed
 * decompose files by extracting embedded content objects (images, audio, video) and extracting all text (indexing text and objects)
 * create images of each page/slide within a document (any video processing?) and index them
 * pull out indexed unique terms to function as keyword suggestions