Luffa Picture: Do you know the meaning of each IP address of Baidu search engine crawling spider?

Luffa Picture: Do you know the meaning of each IP address of Baidu search engine crawling spider?

Many outstanding people in the Internet industry have conducted in-depth analysis of Baidu Spider IP, and it can be said that they have achieved a certain level. As the largest search engine in China, Baidu also occupies half of the search engine market. Baidu's algorithm adjustments in recent months have also made SEOs exhausted. Various names and updated algorithm methods will appear on major websites and portals. More keywords in major SEO popular communities have become Baidu algorithms, search engine spider crawling return log codes, Baidu snapshot updates, Baidu being demoted and punished, and so on. The author can analyze the status of the website based on different IPs. The following are common Baidu spider IPs:

123.125.68.* This spider comes frequently, while others come less frequently. This indicates that the website may need to enter the sandbox or be demoted.

220.181.68.* This IP segment only increases every day and never decreases. It is very likely to be in the sandbox or K station.

220.181.7.* and 123.125.66.* represent the Baidu spider IP visiting, ready to crawl your stuff.

The IP segment 121.14.89.* is used as a new site during its probation period.

203.208.60.* This IP segment appears in new sites and after sites have abnormal phenomena.

210.72.225.* This IP segment patrols each station non-stop.

125.90.88.* Guangdong Maoming Telecom also belongs to Baidu Spider IP. The main reason is that there are many newly launched sites, and there are also webmaster tools or SEO comprehensive detection.

220.181.108.95 This is the dedicated IP for Baidu to crawl the homepage. If it is in the 220.181.108 segment, basically your website will take snapshots overnight every day. It will be absolutely correct, I guarantee it.

220.181.108.92 Same as above, 98% crawls the homepage, and may also crawl other (not internal pages). The 220.181 segment belongs to the weighted IP segment. The articles or homepages crawled by this segment are basically released 24 hours a day.

123.125.71.106 crawls the internal pages and has a lower weight. The internal page articles crawled through this section will not be released soon because they are not original or collected articles.

220.181.108.91 is comprehensive, mainly crawling homepages and inner pages or others. It belongs to the weighted IP segment, and the crawled articles or homepages are released basically 24 hours a day.

220.181.108.75 focuses on crawling 90% of the inner pages of updated articles, 8% of the homepage, and 2% of others. For weighted IP segments, crawled articles or homepages are released basically 24 hours a day.

220.181.108.86 is dedicated to crawling the homepage IP weight segment. The general return code is 304 0 0, which means it has not been updated.

123.125.71.95 crawls the internal pages and has a lower weight. The internal page articles crawled through this section will not be released soon because they are not original or collected articles.

123.125.71.97 crawls the internal pages and has a lower weight. The internal page articles crawled through this section will not be released soon because they are not original or collected articles.

220.181.108.89 is dedicated to crawling the home page IP weight segment. The general return code is 304 0 0, which means it has not been updated.

220.181.108.94 is dedicated to crawling the home page IP weight segment. The general return code is 304 0 0, which means it has not been updated.

220.181.108.97 is dedicated to crawling the home page IP weight segment. The general return code is 304 0 0, which means it has not been updated.

220.181.108.80 is dedicated to crawling the homepage IP weight segment. The general return code is 304 0 0, which means it has not been updated.

220.181.108.77 is a dedicated IP weight segment for capturing homepage IPs. The general return code is 304 0 0, which means it has not been updated.

123.125.71.117 crawls the internal pages and has a lower weight. The internal page articles crawled through this section will not be released soon because they are not original or collected articles.

220.181.108.83 is dedicated to crawling the homepage IP weight segment. The general return code is 304 0 0, which means it has not been updated.

Note: There are many more IPs with the same last digit as above, but the IPs with the same segment 123.125.71.* represent a lower weight for crawling internal pages. This may be because the articles you collected or pieced together are temporarily included but not released. (That means it is pending).

The IP segment 220.181.108.* mainly crawls home pages, accounting for 80%, and internal pages, accounting for 30%. The articles or home pages crawled this time will definitely be released within 24 hours and overnight snapshots, I can guarantee this!

Generally, the return code for a successful crawl is 200 0 0. Returning 304 0 0 means that the website has not been updated and the spider has been there. If it is 200 0 64, don't worry, this is not a K site. It may be that the website is dynamic, so this code is returned.

The IP of Baidu crawler spider shared above is only for sharing with everyone. I hope SEO can build a standard website and publish high-quality and valuable content. From the user experience point of view, I believe Baidu will not treat you unfairly. The author of this article is also extracted from the Internet. After modification, it will be published to the majority of SEO enthusiasts for reading. I believe that your reading will improve your professional ability. I believe that Baidu will not K my site because of the article I extracted, because we are serving webmasters, this is a valuable article.

<<:  The essence of WeChat public account operation: from original to unique, from low price to priceless

>>:  8 ways to make your app icon stand out

Recommend

Sogou promotion account background optimization skills

In the hot summer, faced with complex account man...

How much does it cost to invest in Anshan Sports Mini Program?

How much does it cost to attract investment in An...

Tencent Guangdiantong advertising optimization strategy!

In the 2020s, is there still anyone who doesn’t k...

Brand Marketing Promotion: How to design a poster?

Nowadays, countdowns are widely used, including b...

10 user growth cases to analyze the secrets of K12 referrals

Online education has been booming in recent years...

The underlying logic of brand growth and SOP implementation steps

In terms of underlying logic and final results, t...

How can “cultivation games” improve user retention and conversion rates?

In recent years, all major leading products have ...