Introduction to Yin Tao: How should you write the robots file to get a high ranking in Baidu?

How should I write the robots file to rank higher in Baidu? Friends who believe in SEO know that they need to write a contract for the files in the root directory of the robots before going online.

How to write robots file to rank high in Baidu

What are robots?

When Baidu spider visits a website, it will first check whether there is a plain text file called robots.txt under the root domain of the website (a file that the spider needs to visit when crawling the website). This file is used to indicate the spider's crawling boundaries on your website.

If you do not modify the robots.txt file, the spider will crawl your backend when crawling the website. Including your JS and CSS files means that your website is transparent in front of spiders.

What are the consequences of crawling the background? Some friends who don’t understand may ask

If the spider crawls the backend of your website, then the location of the backend of the website will be included.

Then when you search on Baidu, the search engine may exclude your background search, and the consequences can be imagined. A friend with a little hacking skills can break into your backend in minutes. Isn't that scary?

Robots general format

User-agent:* defines the blocked search engine name. Baidu (Baiduspide), google (Googlebot), 360 (360Spider), etc.

* represents all search engines

Disallow: Do not allow crawling and inclusion

For example: the background name is dede, so if I don’t want spiders to visit it, I would write: /dede/

“/” and “/” are exact matches

"/" trivial match

"$" matches the end of line character

"*" matches 0 or more characters

Allow (permit crawling, usually not written, just admit it, of course, if there are special requirements, you may write it)

#: Description Annotation

Upgrade knowledge

Block directories from crawling

Block spiders from crawling the inc folder under the root directory and all its contents, and the index.html file under the wap directory under the root directory.

How to write robots.txt:

User-agent:*

Disallow:/inc/ (prevent crawling the contents inside the inc folder)

Disallow:/wap/index.html (prevent crawling of index.html files in the wap directory)

Block a directory but grab a file under it

1. Block all spiders from crawling the wap folder under the root directory, but crawl the files with the suffix html inside

How to write robots.txt:

User-agent:*

Disallow:/wap/ (prevents crawling of contents inside the wap folder)

Allow::/wap/*.html (permits crawling files with the suffix html under wap)

2. Prevent crawling of all folders and files with the "wap" character in the root directory. Here we need to use the (/normal match) writing method

User-agent:*

Disallow:/wap (one “/” is fine)

3. Protect private folders or files

While preventing search engines from crawling certain private folders, it also reflects the directory structure of the website, guessing the website's backend processing system, background, etc. (This is basically not used in normal websites), we might as well use the broad writing method to protect important files.

For example: to prevent crawling /inli, you might as well write it as follows. Of course, the premise is that there are no folders or files with these characters in front of them in your root directory for the spider to crawl.

User-agent:*

Disallow:/inli

Block dynamic URLs

Sometimes dynamic pages may be the same as static pages, resulting in duplicate inclusion. (Affects spider friendliness)

Block dynamic URLs

User-agent:*

Disallow: /*?*

Only URLs with the suffix ".html" are allowed to be accessed

User-agent:*

Allow:.html$

Disallow:/

Block dead links

Submit broken links to Baidu Webmaster Platform

Robots prevent spiders from crawling broken links. The writing method is the same as above, so it is better to include a complete path

User-agent:*

Disallow: (website domain name)

Block links to pages that are not included in Baidu rankings

Writing method:

Add a nofollow note directly to the page link that does not need Baidu ranking

>arel="nofollow" href="website location" <landing>/a<

Location of sitemap index in robots.txt

The best place to place the sitamap (website map) is below robots.txt, and the spider will crawl there first according to the principle mentioned above.

Sitemap: "Website location" + "sitemap.xml"

Sitemap: "Website location" + "sitemap.html"

<<: Three steps to launch the event!

>>: How to prevent user churn starting from the user life cycle?

Shen Liang's internal training video tutorial on futures starting point trading system (only talk about the dry goods and no nonsense)

Blog

The "unspoken rules" of App Store operation, promotion and marketing

Blog

Channel Operation丨If you want to be a promotion expert, you need to combine surprising tactics

Blog

Luo Wen, Vice Minister of the Ministry of Industry and Information Technology: In 2017, the scale of my country's electronic information industry reached 18.5 trillion yuan

Blog

[In-depth Revelation] Analysis of Internet Financial Product Operation Strategies!

Old School Seo: Baidu Domination Course "Search Engine Promotion System Can Be Replicated to Build a Precise Passive Traffic System" with Tools

Blog

How to increase user base by 100 times by bringing in new users from old customers? Master these two key operational skills

First, we need to clarify one question: Why do we...

Understand at a glance: If you want to have myopia surgery during the summer vacation, quickly save this nanny-level guide!

Every summer, especially after the college entran...

New research: 50 years of tobacco control has saved 3.85 million lives and extended life expectancy by 19.8 years per person

Recently, a major study published in the Journal ...

Is dopamine, which is so popular, really the "code for happiness"?

How much is the investment price for the Hegang fast food mini program? Hegang fast food mini program investment price inquiry

How much does it cost to attract investment in th...

Introduction to Yin Tao: How should you write the robots file to get a high ranking in Baidu?

Shen Liang's internal training video tutorial on futures starting point trading system (only talk about the dry goods and no nonsense)

The "unspoken rules" of App Store operation, promotion and marketing

Channel Operation丨If you want to be a promotion expert, you need to combine surprising tactics

Luo Wen, Vice Minister of the Ministry of Industry and Information Technology: In 2017, the scale of my country's electronic information industry reached 18.5 trillion yuan

[In-depth Revelation] Analysis of Internet Financial Product Operation Strategies!

iOS Tips: Disable the annoying app rating reminders in apps

How does this magical building come with the “warm in winter and cool in summer buff”?

PPTV A three-year medical history of an "anemia" patient

Thickening of fingers is a sign of lung cancer? If you have these symptoms, the disease may be approaching...

Old School Seo: Baidu Domination Course "Search Engine Promotion System Can Be Replicated to Build a Precise Passive Traffic System" with Tools

Recommend

Feng Shui layout method for Wenchang position in 2020

Take a photo! Tonight, Mars and the Moon will be in the same frame!

Don’t be fooled again! Have you fallen into the trap of eyewash?

How to increase user base by 100 times by bringing in new users from old customers? Master these two key operational skills

Understand at a glance: If you want to have myopia surgery during the summer vacation, quickly save this nanny-level guide!

Colliers: Insights into China's new energy vehicle overseas market in 2024

How to improve your user operation system? Here are 4 ways

What are the benefits and effects of Baidu's bidding account hosting service?

This guy moved, and 20 million people watched! What happened?

The VR concept is very hot. What do investors and practitioners think about it?

China Automobile Dealers Association: Automobile consumption index in May 2021 is 69.7

360 LeTV uses Coolpad to balance Xiaomi?

New research: 50 years of tobacco control has saved 3.85 million lives and extended life expectancy by 19.8 years per person

Is dopamine, which is so popular, really the "code for happiness"?

How much is the investment price for the Hegang fast food mini program? Hegang fast food mini program investment price inquiry