# robots.txt for http://www.csun.edu/ # permitting most crawling and scraping User-agent: * Disallow: # banning Google's robot from /homepage/* User-agent: googlebot Disallow: /homepage/* # banning Google's robot from /calendar/* User-agent: googlebot Disallow: /calendar/* # banning Google's robot from /account/* User-agent: googlebot Disallow: /account/* # banning Google's robot from /portal/* User-agent: googlebot Disallow: /portal/* # banning Google's robot from /test/* User-agent: googlebot Disallow: /test/* # banning Google's robot from /ua/email/* User-agent: googlebot Disallow: /ua/email/* # banning Google's robot from /search/* User-agent: googlebot Disallow: /search/* # banning Google's robot from /outage/* User-agent: googlebot Disallow: /outage/* # banning Google's robot from /_archives/* User-agent: googlebot Disallow: /_archives/* # banning Google's robot from /ua/development/annualgiving/* User-agent: googlebot Disallow: /ua/development/annualgiving/* # banning Google's robot from /studentaffairs.csun.edu/support/attachments/71/transfer_students_fall_2010__upload.csv* User-agent: googlebot Disallow: /studentaffairs.csun.edu/support/attachments/71/transfer_students_fall_2010__upload.csv* # banning Google's image index robot User-agent: Googlebot-Image Disallow: / # banning Yahoo! Image Search robot User-agent: Yahoo-MMCrawler Disallow: / # banning picsearch robot User-agent: psbot Disallow: / # prevent crawling the NOC subsite User-agent: * Disallow: /~nocweb/* User-agent: * Disallow: /it/noc/* # per client request (4180763) steer searchbots to human-readable URL User-agent: * Disallow: /~uclub99/* User-agent: * Disallow: /~amc1700/* #Ticket 4279905 User-agent: * Disallow: /~hfar010/* #Ticket 4280435 ################################################ # The following are defunct accounts and/or the # formalized URLs should be used in lieu of the # the /~/ convention. # User-agent: * Disallow: /~acrp/ Disallow: /~amc1700/ Disallow: /~bha/ Disallow: /~bhp/ Disallow: /~bus302/ Disallow: /~busadmn/ Disallow: /~buscsbe/ Disallow: /~busdean/ Disallow: /~businter/ Disallow: /~buslab/ Disallow: /~buslab// Disallow: /~buspace/ Disallow: /~bustutor/ Disallow: /~cfcfbr/ Disallow: /~economics/ Disallow: /~ecsdean/ Disallow: /~facacct/ Disallow: /~hfaas002/ Disallow: /~hfaca002/ Disallow: /~hfact003/ Disallow: /~hfanr055/ Disallow: /~hfbio002/ Disallow: /~hfbus018/ Disallow: /~hfbus019/ Disallow: /~hfbus021/ Disallow: /~hfbus023/ Disallow: /~hfbus024/ Disallow: /~hfchm006/ Disallow: /~hfchs006/ Disallow: /~hflin001/ Disallow: /~hflrc006/ Disallow: /~hflrc007/ Disallow: /~hfmbp006/ Disallow: /~hfmth006/ Disallow: /~hfmth009/ Disallow: /~hfoao078/ Disallow: /~hfsnm011/ Disallow: /~humndean/ Disallow: /~magtdept/ Disallow: /~mba/ Disallow: /~mic/ Disallow: /~nsls/ Disallow: /~nursing/ Disallow: /~pb2789/inscenter/ Disallow: /~philos33/ Disallow: /~smaceop/ Disallow: /~smssceop/ Disallow: /~vfoao0lc/ #More entries added,same ticket 4280435 ################################################ # The following are defunct accounts and/or have # the formalized URL shoudl be used. Each of # these also have an entry in the aliases.conf. # Hence, no robot should access the content # using # the /~/ convention. User-agent: * Disallow: Disallow: /~acrp/ Disallow: /~aisp/ Disallow: /~amc1700/ Disallow: /~atc/ Disallow: /~bhp/ Disallow: /~btcuse1/ Disallow: /~bus302/ Disallow: /~busadmn/ Disallow: /~buscier/ Disallow: /~buscqpe/ Disallow: /~busdean/ Disallow: /~busdevel/ Disallow: /~buslab/ Disallow: /~busoit/ Disallow: /~bustutor/ Disallow: /~cenamstu/ Disallow: /~coms/ Disallow: /~cqpe/ Disallow: /~cre1273/ Disallow: /~csbs/ Disallow: /~ecsdean/ Disallow: /~eopcntrl/ Disallow: /~facacct/ Disallow: /~geology/ Disallow: /~gripact/ Disallow: /~hfaas002/ Disallow: /~hfaca002/ Disallow: /~hfact003/ Disallow: /~hfact005/ Disallow: /~hfact008/ Disallow: /~hfanr055/ Disallow: /~hfant005/ Disallow: /~hfart010/ Disallow: /~hfbio002/ Disallow: /~hfbus019/ Disallow: /~hfbus021/ Disallow: /~hfchm006/ Disallow: /~hfchs006/ Disallow: /~hfgeg003/ Disallow: /~hfhis003/ Disallow: /~hflin001/ Disallow: /~hflrc007/ Disallow: /~hfmbp006/ Disallow: /~hfmth006/ Disallow: /~hfmth009/ Disallow: /~hfmus003/ Disallow: /~hfpsy004/ Disallow: /~hfsnm011/ Disallow: /~hfsoc012/ Disallow: /~hhd/ Disallow: /~hhd/ Disallow: /~hhd/ Disallow: /~hhd/ Disallow: /~journalism/ Disallow: /~jsprogr/ Disallow: /~magtdept/ Disallow: /~massocwork/ Disallow: /~mba/ Disallow: /~mc20842/ Disallow: /~meis/ Disallow: /~pc14996/ Disallow: /~philos33/ Disallow: /~physics/ Disallow: /~rs46976/ Disallow: /~sch/ Disallow: /~sfverc/ Disallow: /~slirussian/ Disallow: /~theatre/ Disallow: /~ugs/ # Associated with IT Disallow: /~academtech/ Disallow: /~it/ # Associated with President's office Disallow: /~presofc/ # Associated with Student Affairs Disallow: /~ad46894/ Disallow: /~bha/ Disallow: /~deltaspi/ Disallow: /~mic/ Disallow: /~nso/ Disallow: /~sdip/ Disallow: /~shcenter/ Disallow: /~web_talentsearch/ Disallow: /~studaff/ Disallow: /~dres/ Disallow: /~ncod/ Disallow: /~counseling/ Disallow: /~commence/ Disallow: /~financialaid/ Disallow: /~international/ Disallow: /~career/ Disallow: /~testing/ Disallow: /~hfdss006/ Disallow: /~hfout007/ Disallow: /~mynorthridge/ # Associated with Univ Corp Disallow: /~vfoao0lc/