If you're asking 'what is website archiving?' You've come to the right place. Website archiving is the process of automatically collecting websites and the assets they contain and preserving them in a lasting digital archive.
To achieve this, a 'web crawler' is used to crawl a website, extracting and saving all of the assets and information (learn more about this process here).
Web archiving is the only way of capturing digital records in a form that’s timestamped and immutable, allowing organisations to replay their websites from a specific point in time.
An IGI survey found that 98% of organisations have online data they need to keep - or want to keep - for more than ten years. The cited reasons include:
This isn't an exhaustive list either, organisations are getting more from their web archives today, recognising multiple uses in how they manage, supervise, and govern their online channels.
Due to limited resource and technology, organisations often adopt external archiving solutions. Through an automated service, firms are then able to capture the required records (you can find more detail here).
However, many archiving vendors struggle to capture the modern web due to its complexity and size, instead offering a 'catch-all' solution for multiple channels that's both ineffective and inefficient.
Accuracy - Due to the complexity of modern websites, capturing a website with accuracy has become a struggle for many businesses and service providers.
Replayability - Archiving the assets that make up a website is one thing, but being able to replay a fully interactive website (exactly as it appeared) isn't easy.
Storage - Due to a lack of innovation, many organisations are still using legacy archiving systems that result in huge storage costs and limited scaleability.
MirrorWeb is able to capture and archive all of your websites no matter the size, no matter the complexity. Following capture, we bring them into a single archiving platform where you can then replay, search and interrogate them at any time.
To put things into context, we've:
Every archive is time-stamped, immutable and held as legally admissible records that you own. All records are ISO/WORM compliant and can be accessed on-demand.
Answer your regulatory requirements across:
Whether you need to meet compliance requirements, eDiscovery demands, or ensure digital preservation, we deliver peace of mind by providing web records that protect your organisation.
Get as much detail as you need - access advanced crawl reports that include a full breakdown of MIME types and meta-data, giving you the ability to extract and dissect web data like never before.
All archives are time-stamped, hashed and stored in the ISO28500 standard WARC file (WORM) format.
The client portal gives you control over your archives and enables you to replay any website content at any time.
Full text search across your entire archive and daily digest/comparison reports on all website content.
Identify specific content in your archive and review changes with our content comparison tool.
You define the frequency. Daily, weekly or monthly crawls for your website and social media channels.
All archived website content can be made available to eDiscovery professionals, litigators and other third parties for investigative purposes.
Stay in total control of your data by choosing where it's archived, ensuring full compliance with ISO standards.
Our cloud-based platform is light touch, requiring no infrastructure costs or extra resource burdens on customers.
All of your archives are available as a downloadable PNG or PDF to support your record-keeping processes.
MirrorWeb Limited
Kenworthys Buildings / 83 Bridge Street
Manchester / M3 2RF / United Kingdom
Registered in England / Registration No. 08072284
251 Little Falls Drive / Willmington
Newcastle / Delaware / 19808 / United States
0800 222 9200
info@mirrorweb.com
Website Archiving / Wayback Machine Alternatives