Audit of video hosting Rutube
A site with a huge number of pages. Simply scanning such a site with a standard audit program will not work. There will not be enough RAM, even if you add up to the maximum possible amount.
Previously, I had already audited sites of large and enormous sizes. During the audit, Rutube finally tested the technique of breaking the site into parts.
First, I collect information about the sizes of the main sections of the site by preliminary scanning. And according to the table with sizes, I prepare partition packages. It turned out to be only 4. The largest folder is Video.
This was the only section that was not fully scanned. But it provided a high-quality representative sample of site pages. Errors that are typical for the entire section.
The remaining sections have been scanned and processed in full.
The most interesting comments found were:
• Ban on indexing Sitemaps.
• Using links with parameters in navigation elements.
• Internal links without anchors (text or images with alternative text).• 3 important canonized pages that could be landing.• Identified source-generator of empty pages and links to them.• A problem with access to Sitemap maps has been detected.• Signs of not complete content of data in the maps of the site are detected.• A router with protocols in the map addresses of the site Sitemap.• False data in the Sitemap maps.Other comments of less importance have been found.
Previously, I had already audited sites of large and enormous sizes. During the audit, Rutube finally tested the technique of breaking the site into parts.
First, I collect information about the sizes of the main sections of the site by preliminary scanning. And according to the table with sizes, I prepare partition packages. It turned out to be only 4. The largest folder is Video.
This was the only section that was not fully scanned. But it provided a high-quality representative sample of site pages. Errors that are typical for the entire section.
The remaining sections have been scanned and processed in full.
The most interesting comments found were:
• Ban on indexing Sitemaps.
• Using links with parameters in navigation elements.
• Internal links without anchors (text or images with alternative text).• 3 important canonized pages that could be landing.• Identified source-generator of empty pages and links to them.• A problem with access to Sitemap maps has been detected.• Signs of not complete content of data in the maps of the site are detected.• A router with protocols in the map addresses of the site Sitemap.• False data in the Sitemap maps.Other comments of less importance have been found.