Zwielicht
Sheriff
Administrator
Founding Member





- Joined
- Nov 10, 2024
- Messages
- 2,207
- Reaction Score
- 6,769
I've already posted about this on the Xenforo forums and my business website, but I'm posting it here as well as many of you many find this information useful. The bots I talk about don't just visit forums, but websites generally.
Before I continue, you can see how I syndicated the content here. I rewrite everything so that's it's not exactly the same rather than copy/paste it as is. Feel free to do the same here on Office Outlaw to grow your business sites!
Anyway, if you've kept up with the changelog here on Office Outlaw, you'll know that I made some changes yesterday to deal with the page load speed issue. As the forum has grown since March, we've been getting more bots visiting the site. At first, these bots we wanted like Googlebot and Bingbot, but as time went on, we started getting more undesirable bots that offered absolutely no benefit to us. In fact, these bots were visiting the forum so often, sometimes hundreds of times in a very short span of time, thus slowing down the website as it was like over a thousand people loading a page at once.
Frustrating, right? The worst part was that upon further research, some of these bots reportedly ignored robots.txt files. I ended up blocking them via Cloudflare, although if you don't use Cloudflare, you can block them via .htaccess or even a bot management tool if your web hosts has one like mine does.
Blocking via .htaccess example
Blocking via robots.txt example
If you're curious to know which bots I blocked and the reasons why, I've provided a list below:
Before I continue, you can see how I syndicated the content here. I rewrite everything so that's it's not exactly the same rather than copy/paste it as is. Feel free to do the same here on Office Outlaw to grow your business sites!
Anyway, if you've kept up with the changelog here on Office Outlaw, you'll know that I made some changes yesterday to deal with the page load speed issue. As the forum has grown since March, we've been getting more bots visiting the site. At first, these bots we wanted like Googlebot and Bingbot, but as time went on, we started getting more undesirable bots that offered absolutely no benefit to us. In fact, these bots were visiting the forum so often, sometimes hundreds of times in a very short span of time, thus slowing down the website as it was like over a thousand people loading a page at once.
Frustrating, right? The worst part was that upon further research, some of these bots reportedly ignored robots.txt files. I ended up blocking them via Cloudflare, although if you don't use Cloudflare, you can block them via .htaccess or even a bot management tool if your web hosts has one like mine does.
Blocking via .htaccess example
Code:
<IfModule mod_rewrite.c>
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} Rogerbot [NC]
RewriteRule ^ - [F,L]
</IfModule>
Blocking via robots.txt example
Code:
User-agent: Moz
Disallow: /
If you're curious to know which bots I blocked and the reasons why, I've provided a list below:
- Bytespider: By far the worst one. I found out many other Xenforo forum owners were dealing with this one swarming their forums and ignoring robots.txt files. It’s Bytedance AKA TikTok’s bot that trains their language learning models, which means there’s absolutely no benefit to you for allowing this one to crawl your website.
- Amazonbot: You can consider allowing this one if you’ve found people use Alexa with your site and you want to keep that feature working.
- Semrush: This one is useful if you’re using tool or if you plan on selling your website in the future as many in the industry feel they’re fairly accurate with collecting data. Since I don’t use the tool and I don’t plan on selling my forum, I’ve blocked it.
- Ahrefs: Same situation as before where it's only useful if you're using the tool.
- Mozlinkexplorer: Again, only useful if you're using the tool or plan to use it in the future.