Clusteric Auditor | SEO Audit https://clusteric.com Backlinks, Website & Competition SEO Audit in one tool. Thu, 01 Sep 2022 17:19:49 +0000 en-US hourly 1 https://clusteric.com/wp-content/uploads/2016/04/cropped-logo_kulka-—-kopia-32x32.png Clusteric Auditor | SEO Audit https://clusteric.com 32 32 How to easily search for duplicate content on a website? https://clusteric.com/articles-tips-seo/how-to-easily-search-for-duplicate-content-on-a-website/ https://clusteric.com/articles-tips-seo/how-to-easily-search-for-duplicate-content-on-a-website/#respond Sat, 23 Nov 2019 17:17:22 +0000 https://clusteric.com/?p=7073 What is duplicate content? Duplicate content (DC) is nothing more, than the same content duplicated on different pages of the website....

The post How to easily search for duplicate content on a website? first appeared on Clusteric Auditor | SEO Audit.]]>
What is duplicate content?

Duplicate content (DC) is nothing more, than the same content duplicated on different pages of the website.
DC on our website may arise as a result of deliberate action (unconsolidated content strategy), but it often happens that it is strictly technical (e.g. indexing the same resources under multiple URLs by the page engine) or human error.
Clusteric Auditor can help you automatically identify this kind of situation by finding pages that are almost identical, for example, very similar articles.

How do you find duplicate content on the site?

We start with on-site analysis mode.

Analysis duplicate content

In our example, we will try to analyze DIY guide on one of the popular construction-related sites.
Let’s provide the address where we want to start crawling the site.

We choose the analysis of internal links (without other resources, we are only interested in texts).
In addition, we necessarily set the density analysis of words in the content of the page – it is the basis for a later duplicate content scan.

In the next screen we set one more configuration option – in our example we want to analyze the website only in the section devoted to guides.

We run the analysis. When it finishes, we go to the “Generate” and “Similarity scan” tabs.

We’re interested in Google’s point of view, so we’re focusing on indexable resources and without canonical URLs. In addition, you’ll usually want to exclude page text from the analysis that is repeated on many other pages.
These are usually menu elements, constant components of headers or footers, information on privacy policy, cookies, GDPR, etc.

We can also determine what the minimum degree of similarity will cause the texts to appear in the list (scale up to 100 points).
Strong duplicates will usually have over 70 points, but by setting lower values ​​we can also look for articles that are simply related thematically and use the statement for a slightly different purpose (e.g. adding related or internal links, change content strategy or rewrite them).
We run the analysis and after a while (depending on the size of the project) we receive:

If we look at the results in our example, we will see that the list has two strong duplicates (90 and 96 points), but also texts with a very high degree of coverage with minor changes relative to each other (73 points).

The resulting report can also be exported to XLS.

We strongly advise checking duplicate/similar content on your website to avoid technical problems, cannibalisation and improve content strategy.
The free version provides a full on-site report for 1000 URLs and this option is also limited to 1000 URLs, which is almost like unlimited as you can use filters in the crawl to split analyses.

The post How to easily search for duplicate content on a website? first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/articles-tips-seo/how-to-easily-search-for-duplicate-content-on-a-website/feed/ 0
Catalina compatibility and additional data harvestion options https://clusteric.com/new-functions-and-changes-seo/catalina-compatibility-and-additional-data-harvestion-options/ https://clusteric.com/new-functions-and-changes-seo/catalina-compatibility-and-additional-data-harvestion-options/#respond Thu, 14 Nov 2019 13:22:15 +0000 https://clusteric.com/?p=7068 In connection with the release of the new version of MacOS Catalina and changes related to the authorization of the application,...

The post Catalina compatibility and additional data harvestion options first appeared on Clusteric Auditor | SEO Audit.]]>
In connection with the release of the new version of MacOS Catalina and changes related to the authorization of the application, we have introduced all updates required by Apple for notifying Clusteric on OS Catalina.

The contact data collection module has been extended with additional fields of telephone numbers and NIP / TAX field

For the harvester, we’ve added the option to limit the URLs collected below the first 100 results.

changelog:
1.79, 30/10/2019,
Application notation on macOS.
Retaining anchor classification when adding new links to the project.
Extension of contact data collection by telephone numbers and NIP numbers.
An additional filter in the URL clipping tool.
Harvester – the ability to limit the number of URLs collected below the first 100 results.

1.79.5, 11/11/2019,
Adapting the program to changes in the structure of Google results.

The post Catalina compatibility and additional data harvestion options first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/new-functions-and-changes-seo/catalina-compatibility-and-additional-data-harvestion-options/feed/ 0
Export 1 billion data rows from Google Search Console https://clusteric.com/new-functions-and-changes-seo/export-1-billion-data-rows-from-google-search-console/ https://clusteric.com/new-functions-and-changes-seo/export-1-billion-data-rows-from-google-search-console/#respond Wed, 17 Jul 2019 17:42:29 +0000 https://clusteric.com/?p=7081

Downloading large data packages from Google Search Console is no longer a problem. With Clusteric, you can export up to 1 million/billion rows from GSC, compiling them as you like.

This option is also useful for creating archives of data on keywords, views, positions or clicks level for a large number of websites.

Clusteric, thanks to the connection through the API, is able to export all data at once, for all domains you have available in Google Search Console.

Data can also be exported according to filters, on several levels.

How to export an unlimited number of rows from Google Search Console?

1. Open the “Search Console Importer” mode in Clusteric.
2. Authorize Google Search Console (by pasting the temporary code)
3. Select the site for exporting GSC data
4. Specify what data you want to download
5. Determine what time range the data should cover
6. If you want to download only a part, set row limits.

With a large amount of data, when you want to examine only a specific subpage or keyword, you can refine your query with a set of filters.

In practice, this gives unlimited data export volumes.

Changelog: 1.78.5,2019-07-16
Possibility to download up to 1M records in the Search Console connector.
CSV export added in the SC connector.

Export 1 million rows of data from Google Search Console

What is Google Search Console ?

Google Search Console is a free service offered by Google that helps you monitor, maintain, and troubleshoot your site’s presence in Google Search results. You can use Search Console to submit and test your sitemaps, see how Googlebot is crawling your site, check for security issues, and more. Search Analytics in Search Console lets you see how often your site appears in Google search results, and which queries are driving traffic (and clicks) to your site.

How to typically export Google Search Console Data ?

The first step is to log into your Google Search Console account. Once you’re logged in, click on the “Search Analytics” tab on the left sidebar. This will take you to the Search Analytics page, where you can see all of your website’s search data.

At the top of the Search Analytics page, there is a bar with several options. Click on the “Export” option.

This will take you to the Export Data page, where you can select which data you would like to export from Google Search Console. You can export data from the dashboard, search analytics, or analytics data sources.

Once you’ve selected which data you would like to export, click on the “Export” button at the bottom of the page. This will start the export process and your data will be downloaded as a .csv file.

This kind of export is very limited and export only 1000 rows at once.

Export more than 1000 rows from Google Search Console

Exporting more than 1000 rows is always connected to API usage.

You can use Google Big Query, External API services or tools like Clusteric.

With Clusteric, GSC Export to CSV is easy and efficient.

The post Export 1 billion data rows from Google Search Console first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/new-functions-and-changes-seo/export-1-billion-data-rows-from-google-search-console/feed/ 0
Better estimation of keyword difficulty and indexation https://clusteric.com/new-functions-and-changes-seo/better-estimation-of-keyword-difficulty-and-indexation/ https://clusteric.com/new-functions-and-changes-seo/better-estimation-of-keyword-difficulty-and-indexation/#respond Wed, 12 Jun 2019 12:47:52 +0000 https://clusteric.com/?p=7061 In recent updates, proposed changes raised up and optimize the quality of keyword analysis. Additional summary and data caching include much...

The post Better estimation of keyword difficulty and indexation first appeared on Clusteric Auditor | SEO Audit.]]>
In recent updates, proposed changes raised up and optimize the quality of keyword analysis. Additional summary and data caching include much greater speed up analysis. In addition, export / import options have been added for custom formulas.

We have also worked on improving the speed of other analyzes, especially in conditions of reduced proxy performance.

1.77 from 27.05.2019
Keyword Difficulties – Adjustments to match the new presentation of data in search results.
Export / import settings and custom evaluation formulas to / from file.

1.77.5, 10/06/2019
Important update for the new version of macOS.
Analyze Google indexation.
Keyword difficulty mode – performance update.

1.78, 10/06/2019,
Bug fix: some analysis was slowed down.

The post Better estimation of keyword difficulty and indexation first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/new-functions-and-changes-seo/better-estimation-of-keyword-difficulty-and-indexation/feed/ 0
New section in reports (Docx/PDF): Anchor classification https://clusteric.com/new-functions-and-changes-seo/new-section-in-reports-docxpdf-anchor-classification/ https://clusteric.com/new-functions-and-changes-seo/new-section-in-reports-docxpdf-anchor-classification/#respond Tue, 07 May 2019 10:32:21 +0000 https://clusteric.com/?p=7055 You will happily accept the fact that we have additional reports in Clusteric. The anchor classification function was added some time...

The post New section in reports (Docx/PDF): Anchor classification first appeared on Clusteric Auditor | SEO Audit.]]>
You will happily accept the fact that we have additional reports in Clusteric. The anchor classification function was added some time ago, and now we have added its representation in the form of a dedicated section of the Docx/PDF report.

The report will contain information on the statistical distribution of keywords linking to our domain and the share of these anchors in individual groups (money, brand, etc.). In anchor classification, you can define your own groups by analyzing, for example, the popularity of a given section or author. This type of distribution can also be used to define linking strategies and content marketing.

Changelog: 1.76:
New reports: anchor classification.
Important update for Win 10 users (v. 1903).

The post New section in reports (Docx/PDF): Anchor classification first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/new-functions-and-changes-seo/new-section-in-reports-docxpdf-anchor-classification/feed/ 0
Content similarity recognition (duplicated content) https://clusteric.com/new-functions-and-changes-seo/content-similarity-recognition-duplicated-content/ https://clusteric.com/new-functions-and-changes-seo/content-similarity-recognition-duplicated-content/#respond Thu, 21 Mar 2019 09:37:02 +0000 https://clusteric.com/?p=7040 By expanding on-site options with content management tools, we’ve added the ability to compare the similarity of content on the site...

The post Content similarity recognition (duplicated content) first appeared on Clusteric Auditor | SEO Audit.]]>
By expanding on-site options with content management tools, we’ve added the ability to compare the similarity of content on the site (or URL list).
This option makes it easy to pick out duplicate content within the analyzed subpages and determine their degree of similarity.

We described exactly how to use this feature in the article: Recognition of similarities and duplicate content in Clusteric

Changelog: 1.75:
On-site audit: DC scan / similarities (“Generate …” tab).
On-site audit: support for gzip compressed pages.
Additional Meuse API settings.

The post Content similarity recognition (duplicated content) first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/new-functions-and-changes-seo/content-similarity-recognition-duplicated-content/feed/ 0
Keyword density analysis https://clusteric.com/new-functions-and-changes-seo/keyword-density-analysis/ https://clusteric.com/new-functions-and-changes-seo/keyword-density-analysis/#respond Fri, 08 Feb 2019 18:09:34 +0000 https://clusteric.com/?p=7033 In the on-site audit, we’ve added a very useful function of analyzing the density of keywords in the content. With this...

The post Keyword density analysis first appeared on Clusteric Auditor | SEO Audit.]]>
In the on-site audit, we’ve added a very useful function of analyzing the density of keywords in the content. With this option, we can specify at the level of a given subpage how well our text is optimized and whether we do not exceed the number of keywords (over optimization). This option is also useful for investigating why a given subpage is evaluated differently from the competition (e.g. saturation/dentisity comparison for a given set of words).

We also detect page associations at the IP level, not just at the domain.

Changelog 1.74:
New formula: detection of pages on IP addresses, not domains.
On-site audit: word density (URL details).
Amendments.

The post Keyword density analysis first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/new-functions-and-changes-seo/keyword-density-analysis/feed/ 0
Sitemap generator https://clusteric.com/new-functions-and-changes-seo/sitemap-generator/ https://clusteric.com/new-functions-and-changes-seo/sitemap-generator/#respond Tue, 18 Dec 2018 08:40:03 +0000 https://clusteric.com/?p=7020 We’ve added a sitemap generator based on on-site crawl (domain level or URL list). In Tools, in the XLS file merging...

The post Sitemap generator first appeared on Clusteric Auditor | SEO Audit.]]>
We’ve added a sitemap generator based on on-site crawl (domain level or URL list).
In Tools, in the XLS file merging module, we have introduced changes and from now on if we have duplicated column names in individual files, we add numbers to them (sessions, sessions => sessions, sessions_1) and keep the duplicate.

changelog:
On-site audit – sitemap generator.
Small tools – combining XLS files – preserving duplicate columns.
Improving Chinese, Korean and Japanese language recognition.
Amendments.

The post Sitemap generator first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/new-functions-and-changes-seo/sitemap-generator/feed/ 0
Advanced PDF reports for backlink analysis modes. https://clusteric.com/articles-tips-seo/advanced-pdf-reports-for-backlink-analysis-modes/ https://clusteric.com/articles-tips-seo/advanced-pdf-reports-for-backlink-analysis-modes/#comments Wed, 21 Nov 2018 07:26:20 +0000 https://clusteric.com/?p=6853 Advanced PDF reports for backlink analysis modes. When analysing incoming links, SEO reports are very helpful for understanding their quantity and...

The post Advanced PDF reports for backlink analysis modes. first appeared on Clusteric Auditor | SEO Audit.]]>
Advanced PDF reports for backlink analysis modes.

When analysing incoming links, SEO reports are very helpful for understanding their quantity and quality.

Automated creation of SEO reports, showing both the basic data on the number of links and their parameters, keywords and the profile evaluation of these links are now available in Clusteric.

A typical SEO report consists of

Table of contents

1. Audit/domain overview
2. Link profile overview
2.1. Links’ health
2.2. Link status
3. How the website is linked
3.1. SITE-WIDE links
3.2. Rel attribute
3.3. Anchors distribution
4. Where the links come from
4.1. Location
4.2. Linking websites
5. Authority
5.1. Indexation
5.2. Social metrics
5.3. Link strength metrics
5.4. Traffic ranks
5.5. Risk factors
6. Audit summary
6.1. General advice

Creating a report, after analysing links first, is completely automated.

In the agency version, it is additionally possible to export reports to open files (.doc) for further processing.

Reports in the agency version do not have Clusteric branding and are also a great opportunity to acquire new clients. With a small amount of time invested in creating reports on the initial situation of the audited website and the professional appearance of these reports, conversations are easier.

How to create an automated SEO report for backlink evaluation?

1. Download the backlinks you want to analyse. If you have other backlink sources, add them as well.

2. Identify the keywords being researched (up to 1000 keywords).

3. Specify how large the sample of links is to be tested.
Side-wide links have similar parameters, you should not always test them all, especially for large amounts of data.

4. Collect data using Clusteric (the amount of data depends on the chosen mode).
5. Clusteric will suggest an automatic rating and divide the links into several categories.
5. Audit the links according to your own criteria. Extract trusted domains and investigate spam profiles. Assign keywords and analyse content based on link profiles.
6. After the audit, generate a PDF report.

Automatically generated PDF reports on the surveyed link profile occupy about 35 A4 pages. They include descriptions of SEO issues, analysis of the current situation as well as charts and tag clouds.

You can download a sample SEO report from backlink analysis here:

Examples of report elements.

This report was created entirely automatically, based on data from the backlink database collected by Clusteric robots and the default data sampling.

If you need additional data in the report or you have an idea how to interpret them differently, please contact us.

The post Advanced PDF reports for backlink analysis modes. first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/articles-tips-seo/advanced-pdf-reports-for-backlink-analysis-modes/feed/ 2
Very powerful add-on to merge Excel files. https://clusteric.com/articles-tips-seo/very-powerful-tool-to-merge-excel-files/ https://clusteric.com/articles-tips-seo/very-powerful-tool-to-merge-excel-files/#comments Sat, 03 Mar 2018 10:52:47 +0000 https://clusteric.com/?p=6843 New, very powerful tool in the arsenal of “small SEO tools” to manage the process of merging Excel files (.xls /...

The post Very powerful add-on to merge Excel files. first appeared on Clusteric Auditor | SEO Audit.]]>
New, very powerful tool in the arsenal of “small SEO tools” to manage the process of merging Excel files (.xls / .xlsx).

This tool has innumerable applications in the process of combining and processing analytical data from various sources.

Starting from combining data downloaded from the Google Search Console with Google Analytics data at the URL level, through data from external sources combined with internal data.

A few examples of application:

E-commerce and SEO.

Sales data at the level of a given product, combined with the quality and quantity of links leading to the subpage of this product.

E-commerce and customer satisfaction (NPS)
Analytics of customer satisfaction with the product or support on the chat combined with sales from individual URLs.

Advanced SEO.

Merged report: keywords (GSC), website speed (GA), sales (GA), margin (internal data), quality of subpage (PA) and position on given keywords (external data, position monitoring) for a given language.

Analysis of expired domains for the PBN.

The combination of the export “Expired domains” with data on quantity and keywords (titles) downloaded from WebArchive (crawl onsite with individual parameters).

The possibilities of combining and analyzing data are unlimited.

How to work with the program and efficiently combine files?

1. Prepare the XLS files so that the column with the URL is first.
2. The files contain column names in the first row. Watch out for exports from Google Search Console, these usually contain a description of the export at the beginning of the file.
3. Make sure that there are no duplicates in the files.
4. If you use data with relative URLs, such as exports from Google Analytics (/product name.html), add the domain in the “Prefix” field.
5. Merge files and work on ordered data.

Use XLS files to prserv data formatting.

Change log:

New small tool – merging xls / xlsx files.
Proxy server management – optimizations.
version for MacOS – problems with auditing sites using TLS have been fixed.

The post Very powerful add-on to merge Excel files. first appeared on Clusteric Auditor | SEO Audit.]]>
https://clusteric.com/articles-tips-seo/very-powerful-tool-to-merge-excel-files/feed/ 2