Jobs
My ads
My job alerts
Sign in
Find a job Career Tips Companies
Find

Web archiving data analyst and crawl engineer

London
The National Archives, UK
Data analyst
Posted: 21h ago
Offer description

Web Archiving Data Analyst and Crawl Engineer

Join to apply for the Web Archiving Data Analyst and Crawl Engineer role at The National Archives, UK

Archives are special. As a home of our collective memory, The National Archives (TNA) plays a unique role. We hold records of events of national and international importance as well as documents that speak to our everyday lives, over the last one thousand years. Our web archives are unparalleled in their quality and richness and provide a unique source of evidence of our contemporary government and state.

We’re looking for an enthusiastic and skilled Data Analyst with experience in web crawling, scraping, or analysis, to support our workflows, help us understand more about our collection, and grow our web archiving capability.

Web archives are fascinating. At The National Archives, we deliver three public web archive services. They are vast collections of government websites and social media. The scale of these collections, the variety of users’ needs, and the complexity of the data make them challenging and fertile ground for innovation. This role is fundamental to our mission to improve our collection processes, ensuring the highest quality and fidelity, understanding our collection, and conveying these insights to a range of people.

As The National Archives’ Web Archiving Data Analyst and Crawl Engineer, you will bring your expertise and in-depth knowledge to develop and shape key aspects of our web archiving services and therefore you will be a key member of the team as we evolve our services.

We work with suppliers who deliver us many technical services, but we are increasing our in‑house capability and expertise. These workflows are now important parts of our service and you will own them and develop them, including through finding ways to improve efficiency and resilience. You will embrace challenge and look for opportunities to do things differently.

Working closely with the Senior Data Engineer and our Web Archivists, you will help deepen our understanding of our web archiving and social media collections and use these insights to help tell the story of government online. This includes engaging with experts within The National Archives as well as with external organisations across the digital preservation community and other government departments, by sharing your knowledge with others and raising the profile of our work.

You will be passionate about data and technology. You will thrive in an environment which values and supports continuous learning and self‑development.

Web archiving is an exciting, specialist, varied and rapidly evolving field that is a lot of fun to be involved in. Building and maintaining excellent web archiving services calls on a range of skills: problem solving, creativity, developing new techniques for capturing and replaying content, as well as supporting research, and managing stakeholders and projects.

You will support others’ research by delivering development that will help users explore our services “as data”. You will also contribute to the team’s tools and processes, ensuring that we can go about our work as efficiently and effectively as possible.

This is a full‑time post. However, TNA are open to considering requests for part‑time working, flexible working and job sharing. A combination of onsite and home working is available and applicants should be able to regularly travel to our Kew site for a minimum of 60% of their work time.


Person Specification

* Substantial experience in using web data extraction technologies that include web scraping, crawling, data extraction from websites, and handling web-based data formats (HTML, XML, JSON, WARC, CDX)
* Proven ability to build and maintain data pipelines that clean, transform, and aggregate data from multiple sources, ensuring data quality throughout the process
* Experience creating data visualisations, reports, and dashboards that effectively communicate complex findings to diverse audiences, including senior stakeholders.
* Understanding of data management principles for large‑scale digital collections, including quality assurance approaches and working with both structured and unstructured data
* Strong ability to translate technical concepts for non‑technical stakeholders and gather requirements from users at all organisational levels
* Evidence of problem‑solving skills with ability to research complex issues, propose innovative solutions, and adapt to changing priorities
* Strong organisational skills with proven ability to manage workflows, meet deadlines, and work effectively within multidisciplinary project teams
* Experience with cloud technologies (e.g. AWS), APIs, databases, and modern data infrastructure approaches
* Knowledge of digital preservation, web archiving software, or research data management principles with understanding of metadata standards and compliance requirements
* Familiarity with software testing, version control, code documentation best practices, and interest in emerging technologies for digital data challenges

Seniority level: Associate

Employment type: Full‑time

Job function: Other

Industries: Museums, Historical Sites, and Zoos and Government Administration

#J-18808-Ljbffr

Apply
Create E-mail Alert
Job alert activated
Saved
Save
Similar job
Data analyst
London
Lombard Odier
Data analyst
Similar job
Data analyst
London
Lombard Odier
Data analyst
Similar job
Data analyst
London
Lombard Odier
Data analyst
See more jobs
Similar jobs
It jobs in London
jobs London
jobs Greater London
jobs England
Home > Jobs > It jobs > Data analyst jobs > Data analyst jobs in London > Web Archiving Data Analyst and Crawl Engineer

About Jobijoba

  • Career Advice
  • Company Reviews

Search for jobs

  • Jobs by Job Title
  • Jobs by Industry
  • Jobs by Company
  • Jobs by Location
  • Jobs by Keywords

Contact / Partnership

  • Contact
  • Publish your job offers on Jobijoba

Legal notice - Terms of Service - Privacy Policy - Manage my cookies - Accessibility: Not compliant

© 2025 Jobijoba - All Rights Reserved

Apply
Create E-mail Alert
Job alert activated
Saved
Save