How to check page indexing? Search index.

Website indexing in search engines is important for every webmaster. After all, for high-quality promotion of a project, you should monitor its indexing. I will describe the process of checking indexing in Yandex.

Indexing in Yandex

The Yandex robot scans sites day after day in search of something “tasty.” Collects in the top results those sites and pages that, in his opinion, most deserve it. Or maybe Yandex just wanted it that way, who knows?

We, as real webmasters, will adhere to the theory that the better the site is made, the higher its position and the more traffic.

There are several ways to check site indexing in Yandex:

  • using Yandex Webmaster;
  • using search engine operators;
  • using extensions and plugins;
  • using online services.

Indexing website pages in Yandex Webmaster

To understand what the search engine dug up on our site, you need to go to our beloved Yandex Webmaster in the “Indexing” section.

Bypass statistics in Yandex Webmaster

First, let’s go to the “Bypass Statistics” item. This section allows you to find out which pages of your site the robot crawls. You can identify addresses that the robot was unable to load due to the unavailability of the server on which the site is located, or due to errors in the content of the pages themselves.

The section contains information about the pages:

  • new - pages that recently appeared on the site or the robot has just crawled them;
  • changed - pages that the Yandex search engine previously saw, but they have changed;
  • crawl history - the number of pages that Yandex crawled, taking into account the server response code (200, 301, 404 and others).

The graph shows new ( green color) and changed ( Blue colour) pages.

And this is a graph of the crawl history.

This item displays the pages that Yandex found.

N/a — URL is not known to the robot, i.e. the robot had never met her before.

What conclusions can be drawn from the screenshot:

  1. Yandex did not find the address /xenforo/xenforostyles/, which, in fact, is logical, because this page no longer exists.
  2. Yandex found the address /bystrye-ssylki-v-yandex-webmaster/, which is also quite logical, because page is new.

So, in my case, Yandex Webmaster reflects what I expected to see: what is not needed, Yandex has removed, and what is needed, Yandex has added. This means that everything is fine with bypass, there are no blockages.

Pages in search

Search results are constantly changing - new sites are added, old ones are deleted, positions in search results are adjusted, and so on.

You can use the information in the “Pages in Search” section:

  • to track changes in the number of pages in Yandex;
  • to track added and excluded pages;
  • to find out the reasons for excluding a site from search results;
  • to obtain information about the date the search engine visited the site;
  • to receive information about changes in search results.

This section is needed to check the indexing of pages. Here Yandex Webmaster shows pages added to search results. If all your pages are added to the section (a new one will be added within a week), then everything is in order with the pages.

Checking the number of pages in the Yandex index using operators

In addition to Yandex Webmaster, you can check the indexing of a page using operators directly in the search itself.

We will use two operators:

  • “site” - search across all subdomains and pages of the specified site;
  • “host” - search for pages hosted on a given host.

Let's use the "site" operator. Note that there is no space between the operator and the site. 18 pages are in Yandex search.

Let's use the "host" operator. 19 pages indexed by Yandex.

Checking indexing using plugins and extensions

Check site indexing using services

There are a lot of such services. I'll show you two.

Serphunt

Serphunt is an online service for website analysis. They have useful tool to check page indexing.

You can simultaneously check up to 100 website pages using two search engines - Yandex and Google.

Click “Start scanning” and after a few seconds we get the result:


Total

From the author

The goal of the theory and practice of SEO is to get site pages into search results (indexing) and rise in the results of the promoted page. key query. When promoting a website, you need to have at hand, simple and available tools checking which website pages are indexed and which are not. In this article I will show you how to view the number of indexed pages in Yandex. How to check page indexing in Google.

Total volume of pages indexed

The indexing situation can be called ideal if the number of site pages open to search engines coincides with the number of pages in the index.

This means that everything created pages The sites are sufficiently informative and have attracted the interest of search engines for their usefulness.

You need to understand that indexing a page is only the first step, after which you need to promote it in search results. However, with a successful choice of the frequency of the key and its competition, the page will immediately get to the TOP and all that remains is to maintain it there.

As I said, the ideal option is if everything significant pages the site was included in the index. In this case, the number of indexed pages must exactly match the promoted pages. Situations where there are significantly fewer or more pages in the index than pages on the site require urgent correction.

1. If there are significantly fewer pages in the index than pages on the site, it is obvious that you are losing traffic and doing something wrong. Either the pages are not informative, or the content is not unique, or the pages are simply stolen from you and indexed faster on another website. 2. The situation when there are more pages in the index than pages on the site is no better. This means that search engines index duplicate pages or the site does not hide low-information and technical pages from search engines.

Both situations, insufficient and excessive indexing, interfere with website promotion and require study and correction.

To compare the number of pages on a site and the number of pages in the index, you need to know these quantities and be able to quickly see the number of indexed pages in Yandex.

How many pages are there on your site?

At the site creation stage, you had to decide which site material to show to search engines, and which to hide from crawling and indexing.

To control the indexing of pages in Yandex, the directives of the robots.txt file work perfectly. It is the correct filling of the section for the main Yandex bot, User-agent: Yandex, that should become the basis for managing Yandex indexing.

Find out the total number of site pages that Yandex “sees” on any Sitemap (site map) generator by checking the “take into account robots.txt directives” setting. I recommend or.

The number of created site pages can be viewed in administrative panel website on the materials or products page.

It remains to compare the two obtained values ​​with the number of indexed Yandex pages. There are several ways to do this.

How to see the number of indexed pages in Yandex

Method 1. Yandex webmaster

  • Log in (create) your account on Yandex Webmaster. https://webmaster.yandex.ru/
  • Look how many pages you have in search.

Method 2: Browser extensions

Every browser has extensions that show basic or advanced SEO data on the site, including the number of indexed pages in Yandex. Here is one of them, called "RDS bar".

  • for Google()
  • for Mozilla()
  • for Opera()

Method 3. Yandex search query syntax

  • Enter Yandex search (https://ya.ru/);
  • IN address bar enter the search string: host: www.domen.ru | host:domain.ru ;
  • Look at the search result.

All Yandex Query Language

If there are problems with indexing, first of all you need to check robots.txt and sitemap.xml.

Any search engine has a large database where it lists all sites and new pages. This base is called an "index". Until the robot crawls the HTML document, analyzes it and adds it to the index, it will not appear in search results. It will be possible to access it only through a link.

What does "indexing" mean?

No one can tell you about this better than Yandex’s indexing specialist:

Indexing is a process during which a search robot crawls a site’s pages and includes (or does not include) these pages in the search engine index. The search bot scans all content, conducts semantic analysis of text content, the quality of links, audio and video files. Based on all this, the search engine draws conclusions and puts the site in the ranking.

While the site is out of the index, no one will know about it, except those to whom you can distribute direct links. That is, the resource is available for viewing, but search engine he's not there.

Why do you need an index?

The site must be visible in order to promote, grow and develop. A web resource that does not appear in any PS is useless and does not benefit either users or its owner.

In general, here is the full video from the Yandex webmaster school; if you watch it in full, you will become practically an expert in the issue of indexing:

What does indexing speed depend on?

The main points that determine how quickly your site can get into the spotlight search robots:

  • Domain age (the older Domain name, the more bots are favorable to him).
  • Hosting (PS do not like free hosting at all and often ignore it).
  • CMS, code cleanliness and validity.
  • Page refresh speed.

What is a crawl budget?

Each site has a crawling budget - that is, the number of pages beyond which it cannot be included in the index. If the site’s KB is 1000 pages, then even if you have ten thousand of them, there will only be a thousand in the index. The size of this budget depends on how authoritative and useful your site is. And if you have a problem of such a nature that pages do not fall into the index, then as an option, you need, no matter how trivial it may sound, to improve the site!

Site indexing

When creating a new website, you need to correctly fill out the robots.txt file, which tells search engines whether the resource can be indexed, which pages to crawl and which ones not to touch.

The file is created in txt format and is placed in the root folder of the site. Proper robots is a separate issue. This file primarily determines what and how bots will analyze on your site.

Typically, it takes search engines from a couple of weeks to a couple of months to evaluate a new site and enter it into the database.

Spiders carefully scan every allowed HTML document, determining the appropriate topic for a new young resource. This action is not carried out in one day. With each new bypass, the PS will introduce more and more larger number html documents to your database. Moreover, from time to time the content will be re-evaluated, as a result of which the positions of pages in search results may change.

They also help manage indexing robots meta tag and partly canonical. When checking the structure and solving problems with indexing, you should always look for their presence.

Google indexes pages first top level. When a new site with a specific structure should be indexed, the first to be indexed is home page. After this, without knowing the structure of the site, the search engine will index what is closest to the slash. Later, directories with two slashes are indexed. This means that even if the links in the content are high, they will not necessarily be indexed first. It is important to create an optimal structure so that important sections were not behind big amount slashes, otherwise Google will think that this is a low-level page.

Page indexing

When Yandex and Google had already become acquainted with the site and “adopted” it into their search database, bots will return to the resource to scan new, added materials. The more frequently and regularly the content is updated, the more closely the spiders will monitor it.

They say that the PDS pinger plugin for Yandex search helps for indexing - https://site.yandex.ru/cms-plugins/. To do this, you first need to install Yandex search on your website. But I didn’t feel much benefit from it.

When a resource is well indexed, it is much easier to display individual, new pages in the search. But nevertheless, the analysis does not always occur uniformly and at the same speed for all simultaneously updated html documents. The most visited and promoted categories of the resource always win.

What sources of information do search engines have about URLs?

Once upon a time, I hired a quick robot to work on a competitor who had not renewed his domain, so that he would be lowered in the search results - this did not give any result.

How to check indexing

Visibility check html documents carried out differently for Google and Yandex. But in general there is nothing complicated. Even a beginner can do this.

Verification in Yandex

The system offers three main operators that allow you to check how many HTML documents are in the index.

The “site:” operator shows absolutely all resource pages that are already in the database.

Entered into the search bar as follows: site:site

The “host:” operator allows you to see indexed pages from domains and subdomains within the hosting.

Entered into the search bar as follows: host:site

The “url:” operator – shows the specific page requested.

Entered into the search bar as follows: url:site/obo-mne

Checking indexing with these commands always gives accurate results and is the most in a simple way resource visibility analysis.

Google check

PS Google allows you to check the visibility of a site using only one command like site:site.

But Google has one peculiarity: it processes commands differently with and without www entered. Yandex does not make such a distinction and gives absolutely the same results, both with and without registered www.

Checking by operators is the most “old-fashioned” method, but for these purposes I use the RDS Bar browser plugin.

Verification with Webmaster

IN Google services Webmaster and Yandex Webmaster you can also see how many pages are in the PS database. To do this, you need to be registered in these systems and add your website to them. You can access them using the following links:

http://webmaster.yandex.ru/ - for Yandex.

https://www.google.com/webmasters/- for Google.

If the text is not yet in the saved copy, but is on the page, then it can be found by searching [this text] url:site.ru - this will mean that it has already been indexed, but has not yet entered the main index

Bulk checking of pages for indexing

If you run, then checking all pages for indexing is a matter of three minutes.

  1. Go to the distribution file
  2. Select all URLs in the URL column
  3. “Data” tab – “Remove duplicates”, this will leave a list of all promoted pages
  4. We massively check pages for indexing using Comparser. You can also use the Winka browser plugin - it can work with a list of links in isolation from Sapa (call the plugin menu - check the list of links).

Is it possible to speed up indexing?

You can influence the speed of loading HTML documents by search robots. To do this, you should adhere to the following recommendations:

  • Increase the number of social signals by encouraging users to share links in their profiles. Or you can take tweets from live accounts in Prospero (klout 50+). If you create your own Twitter whitelist, consider that you have received a powerful weapon to speed up indexing;
  • Add new materials more often;
  • You can start spinning Direct for the cheapest queries in your topic;
  • Enter address new page in Addurilki immediately after its publication.

High behavioral factors on the site also have a positive effect on the speed of page updating in search. Therefore, do not forget about the quality and usefulness of content for people. A site that users really like will definitely like search robots.

In general, everything is very easy in Google - you can add a page to the index within a few minutes by scanning it in the webmaster panel (item crawl/view as Googlebot/add to index). In the same way, you can quickly reindex the necessary pages.

I also heard stories about guys who sent URLs via Yandex mail so that they would get into the index faster. In my opinion, this is nonsense.

If there is a real problem, that's all previous tips did not help, it remains to move on to heavy artillery.

  • We configure the Last-modified headers (so that the robot checks for updates only documents that have actually changed since its last call);
  • We remove garbage from the search engine index (this garbage can be found using Comparser);
  • We hide all unnecessary/junk documents from the robot;
  • Let's do additional files Sitemap.xml. Usually robots read up to 50,000 pages from this file, if you have more pages, you need to make more sitemaps;
  • Setting up the server.

Good day, dear friends. Many novice webmasters who independently promote their sites do not pay enough attention to indexing their resource. This leads to loss of time and money spent on creating an ineffective Internet platform that search engines and, accordingly, users and advertisers do not like.

Therefore, today, continuing the series of articles on website building, we will talk about how to check whether the site as a whole and its individual pages in particular are indexed, and we will also discuss how and why you need to speed up the indexing process.

We have already said more than once that it is possible only if high level traffic, interesting to potential advertisers.

Most users get to a certain resource from search engines by entering queries that interest them. Search robots compare these queries with their database and output optimal results search. In order for a site to be included in this database, it must be indexed by a search engine. Otherwise, visitors simply will not be able to find it.

At the same time, it is very important that not just a resource, but each of its new page was taken into account by the search engine as quickly as possible. It is optimal if it contains internal linking, which allows you not only to enter into the database new material, but also update the old one using the links provided.

Why should indexing be fast?

In addition to increasing visitor traffic, indexing speed also affects many other resource indicators.

Every day new sites appear on the Internet, the topics of which compete with your web site. All of them are filled with similar content, which, as the number of competitors grows, loses its uniqueness. This happens because most sites publish numerous rewrites. In simple terms if you wrote unique article and they didn’t immediately take care of it being taken into account by the search engine; it’s not a fact that at the time of doing so, the material will remain unique.

In addition, unindexed content becomes a tasty target for scammers. Nothing prevents an unscrupulous webmaster from simply copying the material to his resource, carrying out quick indexing and obtaining the right to authorship from search robots. And search engines will later consider your article not unique, which can lead to a ban on the Internet site. Therefore, controlling and speeding up the indexing process is especially important for young resources making their way.

Another point that depends on fast indexing of each page is the ability to receive money for paid links. After all, until the article with the link is indexed by search engines, you will not receive your reward.

How to check if the site as a whole is indexed?

First, you should make sure that your site is included in the search engine database. To do this you need to find out total number its pages. In the presence of modern system web resource management, view this figure possible in the administrative part. In this case, the total number of pages and records is taken into account.


If for any reason this information is not available, you can use the Xml-sitemaps.com service. Please note that it is free only when working with sites that have up to five thousand pages in their arsenal.

Having found out the required number, you can start checking the indexing of the site in the main search engines - Yandex and Google. There are several ways to do this:

  • Using special tools for webmasters: webmaster.yandex.ru And google.com/webmasters . By registering with them and adding your resource to the system, you will have access to not only data on the number of indexed pages, but also statistics of other “bellies”.
  • Manual check via input special teams to the search bar. In this case, in Yandex you need to enter the construction host: site name + domain or host: www + site name + domain, for example, host: abc.ru. For this, the system will display all indexed pages. To check in Google you will need to enter the query: site: site name + domain, i.e. site:abc.ru.
  • Usage automatic services, checking indexing in both search engines at once. These include, for example, Site-auditor.ru, Pr-cy.ru or Seolib.ru. You can also add the RDS Bar plugin to your browser, which will show information about the resource, including indexing of the pages you are on.

Using any of these methods, you can find out whether the site as a whole is indexed and determine the number of resource pages included in the search engine database.

What to do with the information received?


Ideally, the number of site pages should match the number of indexed pages. Unfortunately, this is not always the case. Two scenarios are much more common:

  • The indexed number of elements is less. Accordingly, you lose a lot in traffic, because for many user requests your site remains inaccessible to them.
  • The number of indexed pages exceeds the actual number of such pages. If you have a similar option, you shouldn’t be happy. Most likely, there is duplication of pages, which dilutes their weight, increases the number of repeated material and interferes with the promotion of the resource.

Both problems need to be resolved as quickly as possible. Otherwise, you risk getting an ineffective web platform, on which you can only make money in your dreams. And to do this, you will have to check the indexing of all pages separately to find out which of them were “rotated”.

How to check the indexing of individual or all pages

Checking a separate page is needed when you need to make sure that new published content is successfully “noticed” by search engines. Or when you purchased a paid link on someone else’s resource and are now looking forward to its indexing. This can be done via:

  • Enter Page URLs s in the Yandex or Google search bar. If there are no problems with the perception of the page by search engines, it will be displayed first in the search results.
  • The already mentioned RDS Bar plugin.

To check the indexing of all site pages, you will need a list of their addresses (URL). To do this, you can use any web resource map generator, for example, Sitemap Generator. To collect only page URLs, do not forget to add a mask of unnecessary addresses, for example, for comments, in the “Exclude Patterns” window. At the end of the process, you should go to the Yahoo Map/Text tab, from where you can copy the generated list of all addresses.

Having it in hand, it will not be difficult to check the indexing of all pages using the program YCCY.ru. Simply add data to the list of source URLs and select one of the suggested search engines: Google, Yandex or Rambler. Click the “Start Test” button and get satisfactory or not so satisfying results.

How to improve and speed up the indexing process?


Having learned the list of unindexed pages, you need to understand the reasons for this. First of all, it is worth checking the quality of the hosting and the web site itself and making sure that the materials posted are unique. Next, monitor the resource for content that is too short (up to 2,000 characters without spaces), containing more than 2-3 links to third party resources, or a lot of Java and Flash links. All these factors can primarily influence the fact that your material remains “invisible” to search engines.

You can speed up the site indexing process by using:

  • frequent updates of unique material, which is greatly appreciated by search engines;
  • competent internal page layout, allowing search engines to see new and update content already contained in the database;
  • publishing links to articles in all in social networks and thematic forums;
  • purchasing links from a boosted account.

I hope you understand that fast indexing resource pages are the basis for its promotion in search engines, on which your potential income directly depends.

Hello everyone, friends!
In today's article I will write about how to check site indexing and separate page in Yandex and Google. In addition, you will learn what a primary and secondary index is in a search engine. Google system. So, let's talk about everything in order.

How to check site indexing in Yandex?
In order to check the indexing of an entire site in Yandex, just enter this address in search bar:

url:www.yourdomain* | url:your domain*

In the search results you can find out how many pages are in the Yandex index, look at and. In addition, using such a request you can see which images Yandex indexes. To do this, just click on the link on the right: “All pictures”:

IN in this case It can be seen that the search engine indexes 83 documents.

By using RDS add-ons Bar you can also see page indexing and more. This extension allows you to learn a lot useful information about any site, all you need to do is just install it on your browser. I wrote in detail where to download and how to use the RDS Bar plugin.

Here, opposite the inscription “Index I” there is a number that is responsible for the number of indexed documents in Yandex. You can click on this number and see all the pages in the results, as in the previous case.

Yandex Webmaster also displays information about site indexing. There you can find out how many pages are prohibited from indexing in, and how many are indexed. But the problem is that the indexing of the resource is shown there with a slight delay. For example, I know that Yandex has now indexed 83 pages, but only 77 are displayed in Webmaster: smile::

But it's still very useful service and you definitely need to go there.

How to check the indexing of a page in Yandex?
Sometimes there is a need to check not the site’s indexing, but specific page. To do this, just enter the following query into the Yandex search bar:

site:address of any page

If the page is indexed, you will see it in the search results:

If it is not indexed, it will write: “The required combination of words is not found anywhere.”

In addition, the indexing of a specific page can be checked using the RDS Bar add-on. If the page is indexed, then opposite the inscription “Index I page.” will write “yes”, when it is not indexed, then accordingly “no”. If the document was indexed recently, the indexing time will be displayed, for example: “yesterday”, “16 hours ago”, etc. This way you can determine:

How to check site indexing in Google?
Before moving on to checking the indexing of a resource in Google, you need to understand that this search engine has two indexes: “main” and “additional”.

Only quality documents, which participate in the ranking.

The additional index contains low-quality pages that appear in search results very rarely. Well, for example, when a user enters some strange query, and there is no answer to it in the main index: smile:. IN additional search Google enters non-unique pages, documents prohibited in the robots.txt file, etc.

Now, using my blog as an example, I will show how you can find out how many documents are in the main search and how many are in the additional search.

First, let's find out how many documents Google indexes. To do this, I enter the following query into the search bar:

As you can see, there are 423 pages in the Google index.

Now we need to find out how many documents are in the main index. To do this, I enter the following query:

There are 108 documents in the main index. Now let's find out how many pages are in the additional index. And it's very easy to do. You need to subtract all the pages that Google indexes from those that are in the main search. In this case, 423 – 108 = 315.

So, 315 pages are “snot” that do not participate in ranking. For some reason, Google included there those documents that are prohibited from indexing in robots.txt. I don’t know why, but there’s nothing wrong with that, the main thing is that the main search contains basic documents that are not prohibited from indexing.

You can also check site indexing in Google using the RDS Bar add-on:

Here we are shown on the left how many pages Google indexes, and on the right what percentage of these documents are included in the main search. But the data may also be slightly inaccurate.

You can also check the indexing of an entire resource in Google using Google tool Webmaster. First, of course, you need to add a site there if it is not there. Then go to the section " Google Index» -> “Indexing status” and select “Extended data”:

Here you can see how many pages are indexed in total and how many are blocked in Robots.txt. But the data is displayed with a delay, so it is not always correct: smile:.

To check indexing separate document in the Google search engine, as in the case of Yandex, just enter the following query into the line:

In addition, RDS Bar also determines page indexing:

As you can see, everything is very simple, if you install the RDS Bar plugin on your browser, you can find out in a second the indexing of a site or page in Yandex and Google.

I guess I'll end here. Bye everyone ;-).