Skip to content

Latest commit

 

History

History

crawling

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

Crawling

This space is reserved for the raw crawled data which may be copied from hdfs or produced locally. The data is divided in two sections as below:

  • Live - Contains data from the ongoing crawl and is not shipped to imagecat server
  • Archive - Contains data which is shipped to imagecat server and is now archived