fishScript.com d
Home| Progetto| Web| Faq| Acronimi

Argomenti

Documenti pubblicabili:1120
Scripts:1282
Documenti non pubblicabili:162
Categorie tematiche:68
.Net
   |_C#
   |_Visual basic.net
   |_Asp.net
Active Server Pages
C++
Cascade Style Sheet
JavaScript
Mysql
Php
Xml
Java
   |_Java 2 Micro Edition
   |_Java server pages
   |_Java Servlet
Oracle
   |_PLSQL
PostgreSQL
Unix






PLSQL... Script: DYN SQL
Architetture web... Faq: Quali sono gli standard web?

Shell scripting... Script: Passing parameters to shell script

Basically, a web session can be defined as the timeframe in which a visitor navigates your web site.



XPath is a language for addressing parts of an XML document. It is designed to be used by XSLT and XPointer.

Basically, a web session can be defined as the timeframe in which a visitor navigates your web site.

Architetture web

Home >Architetture web > Utilizza i file robots.txt

Stampa  Stampa


Quando un robot visita un sito web il primo file che cerca è robots.txt nella cartella principale:

www.w3c.org/robots.txt

Se il file è contenuto in un'altra cartella del sito web i robots lo ignoreranno.

Attrraverso il file robots.txt file è possibile restringere l'azione di ricerca dei robots, disabilitando l'accesso a particolari cartelle o file.

Un esempio del file robot.txt per disabilitare qualsiasi robots a visitare l'intero sito
Here is a sample robots.txt file that prevents all robots from visiting the entire site

User-agent: * # istruzione per tutti i robots
Disallow: / # disabilita l'indicizzazzione di tutte le pagine


http://www.w3.org/robots.txt

Ci devessere un

# exclude some access-controlled areas
User-agent: *
Disallow: /Team
Disallow: /Project
Disallow: /Systems
Disallow: /Web
Disallow: /History
Disallow: /Out-Of-Date
Disallow: /2002/02/mid
Disallow: /mid/
Disallow: /People/all/
Disallow: /2003/03/Translations/byLanguage
Disallow: /2003/03/Translations/byTechnology

*************************************************************
Esempio da www.google.com/robot.txt

User-agent: *
Disallow: /search
Disallow: /groups
Disallow: /images
Disallow: /catalogs
Disallow: /catalog_list
Disallow: /news
Disallow: /pagead/
Disallow: /relpage/
Disallow: /imgres
Disallow: /keyword/
Disallow: /u/
Disallow: /univ/
Disallow: /cobrand
Disallow: /custom
Disallow: /advanced_group_search
Disallow: /advanced_search
Disallow: /googlesite
Disallow: /preferences
Disallow: /setprefs
Disallow: /swr
Disallow: /url
Disallow: /wml
Disallow: /hws
Disallow: /bsd?
Disallow: /linux?
Disallow: /mac?
Disallow: /microsoft?
Disallow: /unclesam?
Disallow: /answers/search?q=
Disallow: /local
Disallow: /froogle?
Disallow: /froogle_


*************************************************************
http://www.altavista.com/robots.txt
User-agent: *

Disallow: /search
Disallow: /sidebar
Disallow: /advanced
Disallow: /alchemist
Disallow: /customize
Disallow: /go
Disallow: /go2
Disallow: /cgi-bin
Disallow: /g/
Disallow: /web
Disallow: /r
Disallow: /babelfish
Disallow: /urltrurl
Disallow: /translate
Disallow: /image/results
Disallow: /image/samepage
Disallow: /image/res_detail
Disallow: /audio/results
Disallow: /audio/samepage
Disallow: /audio/res_detail
Disallow: /video/results
Disallow: /video/samepage
Disallow: /video/res_detail
Disallow: /news/more


Il valore asterisco (*) indica che le politiche di accesso sono valide per ogni robot o user-agent

Il campo "Disallow" specifica quali url non devono essere visitate.

Per esempio, la direttiva:

Disallow: /help

disabilita sia /help.html che /help/index.html

mentre
Disallow: /help/
Disabilita l'accesso a /help/index.html ma non a /help.html.

Se il valore per "Disallow" non è indicato viene indicato al robot che può cercare ovunque nel sito
Disallow:




Warning: include(ads/text468x15.html): failed to open stream: No such file or directory in D:\inetpub\webs\fishscriptcom\documents\view_document.php on line 131

Warning: include(): Failed opening 'ads/text468x15.html' for inclusion (include_path='.;C:\php\pear') in D:\inetpub\webs\fishscriptcom\documents\view_document.php on line 131


signal Marco Magnani marcomagnani@fishscript.com



Cerca





A good scientist is a person with original ideas.
A good engineer is a person who makes a design that works with as few original ideas as possible. There are no prima donnas in engineering.
Freeman Dyson


A good scientist is a person with original ideas.
A good engineer is a person who makes a design that works with as few original ideas as possible. There are no prima donnas in engineering.
Freeman Dyson


Architetture web... Citazioni: The Web is a young medium

Architetture web... Definizioni: Accessibilità

Oracle... Definizioni: Schema


Shell scripting... Script: Cercare e visualizzare file per dimensione



fishScript.Com is accessible by Mobile access technology as mobile phones, Palm and Pocket PC .

Nicoleta e Marco Magnani tutorial, examples, courses, esempi, corsi, esercizi, appunti vari Dottoressa Nicoleta Dragu Formatrice Docente Insegnante Mediatrice Culturale Dott. Marco Magnani Universita La Sapienza Roma Master Computer Science Hunter College New York , Data Base Administrator DBA oracle System architect

Last modified: 2017-11-30 amministratore@fishscript.comNico and Marco Magnani Software Production
Home|About this Site © 2003-2008 www.fishScript.com ®