Luffa Picture: Do you know the meaning of each IP address of Baidu search engine crawling spider?

Luffa Picture: Do you know the meaning of each IP address of Baidu search engine crawling spider?

Many outstanding people in the Internet industry have conducted in-depth analysis of Baidu Spider IP, and it can be said that they have achieved a certain level. As the largest search engine in China, Baidu also occupies half of the search engine market. Baidu's algorithm adjustments in recent months have also made SEOs exhausted. Various names and updated algorithm methods will appear on major websites and portals. More keywords in major SEO popular communities have become Baidu algorithms, search engine spider crawling return log codes, Baidu snapshot updates, Baidu being demoted and punished, and so on. The author can analyze the status of the website based on different IPs. The following are common Baidu spider IPs:

123.125.68.* This spider comes frequently, while others come less frequently. This indicates that the website may need to enter the sandbox or be demoted.

220.181.68.* This IP segment only increases every day and never decreases. It is very likely to be in the sandbox or K station.

220.181.7.* and 123.125.66.* represent the Baidu spider IP visiting, ready to crawl your stuff.

The IP segment 121.14.89.* is used as a new site during its probation period.

203.208.60.* This IP segment appears in new sites and after sites have abnormal phenomena.

210.72.225.* This IP segment patrols each station non-stop.

125.90.88.* Guangdong Maoming Telecom also belongs to Baidu Spider IP. The main reason is that there are many newly launched sites, and there are also webmaster tools or SEO comprehensive detection.

220.181.108.95 This is the dedicated IP for Baidu to crawl the homepage. If it is in the 220.181.108 segment, basically your website will take snapshots overnight every day. It will be absolutely correct, I guarantee it.

220.181.108.92 Same as above, 98% crawls the homepage, and may also crawl other (not internal pages). The 220.181 segment belongs to the weighted IP segment. The articles or homepages crawled by this segment are basically released 24 hours a day.

123.125.71.106 crawls the internal pages and has a lower weight. The internal page articles crawled through this section will not be released soon because they are not original or collected articles.

220.181.108.91 is comprehensive, mainly crawling homepages and inner pages or others. It belongs to the weighted IP segment, and the crawled articles or homepages are released basically 24 hours a day.

220.181.108.75 focuses on crawling 90% of the inner pages of updated articles, 8% of the homepage, and 2% of others. For weighted IP segments, crawled articles or homepages are released basically 24 hours a day.

220.181.108.86 is dedicated to crawling the homepage IP weight segment. The general return code is 304 0 0, which means it has not been updated.

123.125.71.95 crawls the internal pages and has a lower weight. The internal page articles crawled through this section will not be released soon because they are not original or collected articles.

123.125.71.97 crawls the internal pages and has a lower weight. The internal page articles crawled through this section will not be released soon because they are not original or collected articles.

220.181.108.89 is dedicated to crawling the home page IP weight segment. The general return code is 304 0 0, which means it has not been updated.

220.181.108.94 is dedicated to crawling the home page IP weight segment. The general return code is 304 0 0, which means it has not been updated.

220.181.108.97 is dedicated to crawling the home page IP weight segment. The general return code is 304 0 0, which means it has not been updated.

220.181.108.80 is dedicated to crawling the homepage IP weight segment. The general return code is 304 0 0, which means it has not been updated.

220.181.108.77 is a dedicated IP weight segment for capturing homepage IPs. The general return code is 304 0 0, which means it has not been updated.

123.125.71.117 crawls the internal pages and has a lower weight. The internal page articles crawled through this section will not be released soon because they are not original or collected articles.

220.181.108.83 is dedicated to crawling the homepage IP weight segment. The general return code is 304 0 0, which means it has not been updated.

Note: There are many more IPs with the same last digit as above, but the IPs with the same segment 123.125.71.* represent a lower weight for crawling internal pages. This may be because the articles you collected or pieced together are temporarily included but not released. (That means it is pending).

The IP segment 220.181.108.* mainly crawls home pages, accounting for 80%, and internal pages, accounting for 30%. The articles or home pages crawled this time will definitely be released within 24 hours and overnight snapshots, I can guarantee this!

Generally, the return code for a successful crawl is 200 0 0. Returning 304 0 0 means that the website has not been updated and the spider has been there. If it is 200 0 64, don't worry, this is not a K site. It may be that the website is dynamic, so this code is returned.

The IP of Baidu crawler spider shared above is only for sharing with everyone. I hope SEO can build a standard website and publish high-quality and valuable content. From the user experience point of view, I believe Baidu will not treat you unfairly. The author of this article is also extracted from the Internet. After modification, it will be published to the majority of SEO enthusiasts for reading. I believe that your reading will improve your professional ability. I believe that Baidu will not K my site because of the article I extracted, because we are serving webmasters, this is a valuable article.

<<:  The essence of WeChat public account operation: from original to unique, from low price to priceless

>>:  8 ways to make your app icon stand out

Recommend

What is NetQin's intention in issuing another open letter to investors?

On the afternoon of August 8, NetQin once again i...

When is the best time to eat cookies and cakes to stabilize blood sugar?

I believe many people have this dilemma: they wan...

Threshold thinking in marketing!

There is an interesting story circulating on Wall...

Mantou New Media Operation Certificate Class

A millionaire trader will teach you the core skil...

64-bit phones will be everywhere next year: thanks to Android L

When Apple launched the iPhone 5s last year, many...

If we don’t speak the same language, will our thinking also be different?

If we don’t speak the same language, will our thi...

Second-class e-commerce advertising | 15 product cases in 7 categories!

The hot August gathers the enthusiasm of midsumme...

Every time a bunch of bananas is picked, a banana tree dies?

gossip "For every bunch of bananas picked, a...

This kind of paper straw will not soften when soaked. Would you like to try it?

Produced by: Science Popularization China Author:...

App download revenue: Apple takes 85%, while China Mobile only takes 15%

Yesterday, reporters learned from China Mobile tha...