<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Is Hostopia blocking bingbot web crawler intentionally? in Everything else</title>
    <link>https://community.plus.net/t5/Everything-else/Is-Hostopia-blocking-bingbot-web-crawler-intentionally/m-p/1895190#M29894</link>
    <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://community.plus.net/t5/user/viewprofilepage/user-id/8134"&gt;@jtonline&lt;/a&gt;&amp;nbsp;wrote:
&lt;P&gt;My thoughts are that Hostopia is blocking the bingbot web crawler robot at server level, or they're only allowing bingbot from a set range of IP addresses that are out of date.&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Well, if it's the latter, I imagine you're out of luck given they don't seem to publish what these IP's are. From &lt;A href="https://www.bing.com/webmasters/help/which-crawlers-does-bing-use-8c184ec0" target="_blank" rel="noopener"&gt;here&lt;/A&gt;: -&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;"You can identify Bing crawlers with the user agent string. But user agent strings are easy to spoof, so not every request with these user agent strings may be coming from a real Bing crawler. As a rule, &lt;FONT color="#FF0000"&gt;Bing does not share the IP addresses from which we crawl the web&lt;/FONT&gt;, but you can always use the&amp;nbsp;&lt;A href="https://www.bing.com/webmasters/verifybingbot" target="_blank"&gt;Verify Bingbot&lt;/A&gt;&amp;nbsp;tool to check if a crawler belongs to Bing."&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;I can try asking the question of our webhost partner. Assume it's the main 'Bingbot' crawler that you're concerned with?&lt;/P&gt;
&lt;P&gt;My suspicion is that the crawling is being blocked by a Web Application Firewall but that's pure conjecture at this point.&lt;/P&gt;</description>
    <pubDate>Fri, 28 Oct 2022 10:47:32 GMT</pubDate>
    <dc:creator>bobpullen</dc:creator>
    <dc:date>2022-10-28T10:47:32Z</dc:date>
    <item>
      <title>Is Hostopia blocking bingbot web crawler intentionally?</title>
      <link>https://community.plus.net/t5/Everything-else/Is-Hostopia-blocking-bingbot-web-crawler-intentionally/m-p/1895153#M29893</link>
      <description>&lt;P&gt;I can't get my little Plusnet website crawled and therefore indexed by the Bing search engine.&lt;BR /&gt;&lt;BR /&gt;Bing Webmaster Tools says it can't find my robots.txt file, and yet I can browse to it and Googlebot can see it.&lt;BR /&gt;&lt;BR /&gt;Submitted sitemaps via Webmaster Tools are coming back with a 403 error and yet Googlebot can see them and validate them OK.&lt;BR /&gt;&lt;BR /&gt;Getting Webmaster Tools to Inspect a URL comes back with "Discovered but not crawled".&lt;BR /&gt;&lt;BR /&gt;My thoughts are that Hostopia is blocking the bingbot web crawler robot at server level, or they're only allowing bingbot from a set range of IP addresses that are out of date.&lt;BR /&gt;&lt;BR /&gt;Can anyone confirm if they're experiencing the same issue?&lt;BR /&gt;&lt;BR /&gt;&lt;a href="https://community.plus.net/t5/user/viewprofilepage/user-id/14"&gt;@bobpullen&lt;/a&gt;&amp;nbsp;are you able to ask your contacts at Hostopia on my behalf?&lt;BR /&gt;&lt;BR /&gt;My website is &lt;A href="http://www.jtonline.info" target="_blank"&gt;http://www.jtonline.info&lt;/A&gt;.&amp;nbsp; I'm going to start adding some new content &amp;amp; Bing search results would be nice.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 28 Oct 2022 02:15:17 GMT</pubDate>
      <guid>https://community.plus.net/t5/Everything-else/Is-Hostopia-blocking-bingbot-web-crawler-intentionally/m-p/1895153#M29893</guid>
      <dc:creator>jtonline</dc:creator>
      <dc:date>2022-10-28T02:15:17Z</dc:date>
    </item>
    <item>
      <title>Re: Is Hostopia blocking bingbot web crawler intentionally?</title>
      <link>https://community.plus.net/t5/Everything-else/Is-Hostopia-blocking-bingbot-web-crawler-intentionally/m-p/1895190#M29894</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://community.plus.net/t5/user/viewprofilepage/user-id/8134"&gt;@jtonline&lt;/a&gt;&amp;nbsp;wrote:
&lt;P&gt;My thoughts are that Hostopia is blocking the bingbot web crawler robot at server level, or they're only allowing bingbot from a set range of IP addresses that are out of date.&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Well, if it's the latter, I imagine you're out of luck given they don't seem to publish what these IP's are. From &lt;A href="https://www.bing.com/webmasters/help/which-crawlers-does-bing-use-8c184ec0" target="_blank" rel="noopener"&gt;here&lt;/A&gt;: -&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;"You can identify Bing crawlers with the user agent string. But user agent strings are easy to spoof, so not every request with these user agent strings may be coming from a real Bing crawler. As a rule, &lt;FONT color="#FF0000"&gt;Bing does not share the IP addresses from which we crawl the web&lt;/FONT&gt;, but you can always use the&amp;nbsp;&lt;A href="https://www.bing.com/webmasters/verifybingbot" target="_blank"&gt;Verify Bingbot&lt;/A&gt;&amp;nbsp;tool to check if a crawler belongs to Bing."&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;I can try asking the question of our webhost partner. Assume it's the main 'Bingbot' crawler that you're concerned with?&lt;/P&gt;
&lt;P&gt;My suspicion is that the crawling is being blocked by a Web Application Firewall but that's pure conjecture at this point.&lt;/P&gt;</description>
      <pubDate>Fri, 28 Oct 2022 10:47:32 GMT</pubDate>
      <guid>https://community.plus.net/t5/Everything-else/Is-Hostopia-blocking-bingbot-web-crawler-intentionally/m-p/1895190#M29894</guid>
      <dc:creator>bobpullen</dc:creator>
      <dc:date>2022-10-28T10:47:32Z</dc:date>
    </item>
    <item>
      <title>Re: Is Hostopia blocking bingbot web crawler intentionally?</title>
      <link>https://community.plus.net/t5/Everything-else/Is-Hostopia-blocking-bingbot-web-crawler-intentionally/m-p/1895216#M29895</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://community.plus.net/t5/user/viewprofilepage/user-id/14"&gt;@bobpullen&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;I can try asking the question of our webhost partner. Assume it's the main 'Bingbot' crawler that you're concerned with?&lt;/P&gt;
&lt;P&gt;My suspicion is that the crawling is being blocked by a Web Application Firewall but that's pure conjecture at this point.&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for your reply, Bob. &lt;BR /&gt;If you could ask the question that would be great.&amp;nbsp; Yes, Bingbot seems to be the main one.&lt;BR /&gt;I've reached out to Bing Webmaster Tools support team too via their contact form, and await a reply, but I suspect they're going to say 'contact the webhost'.&lt;/P&gt;</description>
      <pubDate>Fri, 28 Oct 2022 13:27:44 GMT</pubDate>
      <guid>https://community.plus.net/t5/Everything-else/Is-Hostopia-blocking-bingbot-web-crawler-intentionally/m-p/1895216#M29895</guid>
      <dc:creator>jtonline</dc:creator>
      <dc:date>2022-10-28T13:27:44Z</dc:date>
    </item>
  </channel>
</rss>

