Skip to content
This repository was archived by the owner on Jan 4, 2023. It is now read-only.
This repository was archived by the owner on Jan 4, 2023. It is now read-only.

Legacy Website Reports are Missing Historical Data #151

@paulcalvano

Description

@paulcalvano

When the HTTP Archive dataset was expanded in July 2018, new page ids were assigned for the newer URLs. This has broken the historical reports, which breaks continuity for the URLs that were previously tracked.

You can see an example of this here - https://legacy.httparchive.org/viewsite.php?pageid=94191763. The legacy report continues to include the latest stats, but now only shows trends starting with July 2018 -

image

@rviscomi and I believe that this can be corrected by mapping the old pages table records with the new pageids. I've assigned this to myself and will look into it.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions