<div dir="auto"><span style="font-family:sans-serif">Further to the original post, as well as not creating a DNS record and "possibly" adding robot.txt with appropriate content, as discussed, I presume that if I run the http server on a personally selected unprivileged port then it is very "unlikely" the site pages will be indexed/discovered/etc surely?</span><div dir="auto" style="font-family:sans-serif"><br></div><div dir="auto" style="font-family:sans-serif">Thoughts?</div><div dir="auto" style="font-family:sans-serif"><br></div><div dir="auto" style="font-family:sans-serif">Thanks.</div><br></div><br><div class="gmail_quote"><div dir="ltr">On Sun, Oct 21, 2018, 20:32 N6ghost <<a href="mailto:n6ghost@gmail.com">n6ghost@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On Thu, 11 Oct 2018 15:39:55 -0400<br>
Barry Margolin <<a href="mailto:barmar@alum.mit.edu" target="_blank" rel="noreferrer">barmar@alum.mit.edu</a>> wrote:<br>
<br>
> In article <<a href="mailto:mailman.671.1539286015.803.bind-users@lists.isc.org" target="_blank" rel="noreferrer">mailman.671.1539286015.803.bind-users@lists.isc.org</a>>,<br>
> Dennis Clarke <<a href="mailto:dclarke@blastwave.org" target="_blank" rel="noreferrer">dclarke@blastwave.org</a>> wrote:<br>
> <br>
> > On 10/11/2018 03:21 PM, Leonardo Rodrigues wrote: <br>
> > > Em 11/10/18 16:13, Barry Margolin escreveu: <br>
> > >><br>
> > >> If you accidentally, or someone else intentionally, create a<br>
> > >> link to the site that uses the IP and put it on a web page that<br>
> > >> Google can get to, it will probably find the page.<br>
> > >><br>
> > >> <br>
> > > <br>
> > > robots.txt, on your website root, is your friend. Simply<br>
> > > deny web crawling on it, and you're (probably) done.<br>
> > > <br>
> > <br>
> > If you believe robots.txt means anything at all. <br>
> <br>
> Google is known to obey it, and the question was about avoiding<br>
> getting your site indexed by Google.<br>
> <br>
> Of course, that doesn't mean someone won't find the site on their<br>
> own. If the link to it is on some other page that isn't blocked by <br>
> robots.txt, someone might stuble across that page and then click on<br>
> the link.<br>
> <br>
> But if you're mainly worried about someone googling the words that<br>
> are on your website and Google sending them to the development<br>
> version instead of the production version, you're pretty safe.<br>
> <br>
> Actually, DNS has very little impact on this at all. AFAIK, Google <br>
> doesn't crawl DNS, it just crawls web pages and follows links. My <br>
> company's development server is in DNS, and it's not firewalled (we<br>
> all work from our homes, there's no company network to restrict<br>
> access with), but I've never heard of anyone accidentally being<br>
> directed there by Google, because we don't publish links to this<br>
> server.<br>
> <br>
<br>
robot.txt is suppose to govern whats indexed... not sure how well its<br>
followed nowadays but thats the process for it.<br>
_______________________________________________<br>
Please visit <a href="https://lists.isc.org/mailman/listinfo/bind-users" rel="noreferrer noreferrer" target="_blank">https://lists.isc.org/mailman/listinfo/bind-users</a> to unsubscribe from this list<br>
<br>
bind-users mailing list<br>
<a href="mailto:bind-users@lists.isc.org" target="_blank" rel="noreferrer">bind-users@lists.isc.org</a><br>
<a href="https://lists.isc.org/mailman/listinfo/bind-users" rel="noreferrer noreferrer" target="_blank">https://lists.isc.org/mailman/listinfo/bind-users</a><br>
</blockquote></div>