WORLD WIDE WEB FAQ _World Wide Web Frequently Asked Questions (With Answers, of Course!)_ Copyright 1994, 1995, Thomas Boutell. This document is available from many sites, and in several languages. _Please use the site closest to you in the language of your choice._ This FAQ consists of many files. By popular request, it is now available as an MSDOS .ZIP file, as a Unix compressed .tar file, and as a single, large text file. If you have trouble browsing HTML files offline under Windows, please see the relevant FAQ entry. Of course, to get the latest and greatest information, it is best to browse it right here on the web! Contents * About this document * Recent changes to the FAQ * Introduction to the World Wide Web * Obtaining and using web browsers * Establishing and using web servers * Authoring web pages, images and scripts * Other resources about the Web * Credits Introduction to the World Wide Web Contents: * What is the Web? * What is a URL? * What are SGML and HTML? * How does the Web compare to Gopher and WAIS? * What is the W3 consortium? * How can I access the Web? * What is available through the web? * How do I find out what's new on the Web? * Where is the subject catalog of the Web? * How can I search through ALL web sites? * Can I catch a virus from a web page? * How can I find out when a web page has changed? * How do I publish on the Web? * Who uses the Web? * What is VRML? * What is Java? Obtaining and using web browsers Contents: * Browsers accessible by telnet * Obtaining Amiga browsers * Obtaining Macintosh browsers * Obtaining MS-DOS (non-Windows) browsers * Obtaining NeXT browsers * Obtaining Unix and VMS browsers * Obtaining VM/CMS browsers * Obtaining Microsoft Windows and OS/2 browsers * Obtaining X Window System / DecWindows browsers * Obtaining Acorn RISCOS browsers * Obtaining batch-mode "browsers" * I can't get SLIP or PPP. I want web access. Is there a way? * Can I browse HTML files locally when I'm offline? * How can I access the Web through a firewall? * I'm running XMosaic. Why don't my external viewers work? * I have a Windows PC or a Mac. Why can't I access WAIS URLs? * How do I print legible ASCII versions of HTML documents? * How can I save an inline image to disk? * How can I send newsgroup posts in HTML to my web browser? * How can I get sound from the PC speaker with WinMosaic? Establishing and using web servers Contents: * Amiga servers * Macintosh servers * MS-DOS and Novell Netware servers * Unix servers * VM/CMS servers * VMS servers * Microsoft Windows, IBM OS/2 and MS Windows NT Servers * Can I serve two domains from one server? * Comparision: which server is best? * How fast does my connection have to be? * Do I have to approve every imagemap my users create? * Can I safely allow my users to run their own CGI scripts? * Can I lease space on an existing server? * How can I keep robots off my server? * How do I publicize my server? * How can I secure access to my server? * How can I keep statistics on my server? * How can I serve [Word documents, Excel spreadsheets...] through my server? Authoring web pages, images and scripts Contents: * Overview: how to create web documents * Writing HTML documents yourself * HTML editors * Converting other formats to HTML * Checking web pages for errors * How can I "include" one HTML document in another? * How can I give my web page a tiled or colored background? * Generating web pages from a program or database (CGI) * How can I identify the user accessing my CGI script? * My CGI script doesn't work! What's wrong? * How can I keep my document from being cached? * How can users send me comments and/or email? * How can I create fill-out forms? * Are HTML 3.0 tables ready? Are there other options? * How can I use inline images without alienating my users? * How can I distribute audio through the web? * How can I generate inline images on the fly? * How can I create hidden fields in forms? * What is HTML 3.0? * How do I comment an HTML document? * How do I create clickable image maps? * How can I create transparent and interlaced GIFs? What are they? * Which is better for the web, JPEG or GIF? * Can I lease space on an existing server? * Can I make a link that doesn't load a new page? * How can I mirror part of another server? * Does mailto: work in all browsers? * How can I serve [Word documents, Excel spreadsheets...] through my server? * How do I publicize my work? * Hey, why can't I write a web-exploring robot? * Where can I get an access counter for my page? Other resources about the Web Contents: * Books about the Web * Mailing lists about the Web * Newsgroups about the Web Credits ABOUT THE WORLD WIDE WEB FAQ The World Wide Web Frequently Asked Questions (FAQ) is intended to answer the most common questions about the web. The FAQ is maintained by by Thomas Boutell . Copyright 1994, 1995 by Thomas Boutell. The complete FAQ is available from several sites. If you can, you will want to access it through the web. Use the site closest to you in the language you prefer (non-English sites are marked); * Sunsite, eastern United States (North America): * Oxford University, UK (Europe): * Poznan University of Technology, Poznan, Poland (Europe, in Polish): * Poznan University of Technology, Poznan, Poland (Europe, in English): * New Software Technologies Service, Austria (Europe): * Astronomical Observatory of Padova, Italy (Europe): * Glocom, Japan (Asia): * The University of Melbourne (Australia/Pacific): * Telstra Corporation, Australia (Australia/Pacific): * Internex Online, Toronto, Canada (North America): * Communications Vir, Montreal, Canada (North America): * Community Access Canada, University of New Brunswick, Canada (North America): * Island Internet, British Columbia, Canada (North America): * Acer Inc., Taipei, Taiwan (Asia, in Chinese): * Fraunhofer Institute for Computer Graphics, Darmstadt, Germany: _________________________________________________________________ _World Wide Web FAQ_ RECENT CHANGES TO THE FAQ _Please accept my apologies for the recent lengthy pause between updates._ Getting married tends to take up all of one's attention for a while. I am also quite far along in the manuscript of "CGI Programming in C and Perl," to be published by Addison-Wesley. Note that all of the constituent files of the FAQ have changed their names. This was done by popular request to make a single MSDOS .ZIP file version of the FAQ possible. My apologies for any convenience. 9/24: * All filenames shortened to make a DOS .zip file version possible. * DOS .zip file version made available. * Single .txt file version made available. * Server-side includes section added. * Acorn Archimedes section added. * Mailing lists now points to the W3 org's information * Imagemapping software section expanded * HTML editor section overhauled * Permissions problems discussed in CGI problems section * Background tiles and colors section added * PhotoGIF, an Adobe Photoshop add-in for sophisticated GIF support * How to stop browsers from caching dynamic pages * Spinner, a nonforking Unix web server * FolkWeb, a threaded Windows 95 and NT server * Snowhare's analysis tools and the analog analysis tool added to statistics section * Books, books, books. * Section on running Windows browsers while offline added * Many miscellaneous fixes and additions not listed here _________________________________________________________________ _World Wide Web FAQ_ CREDITS Maintainer (11/93 to present): Thomas Boutell, _boutell@netcom.com_ Former Maintainer (until 11/93): Nathan Torkington, _Nathan.Torkington@vuw.ac.nz_ _________________________________________________________________ _World Wide Web FAQ_ WHAT ARE WWW, HYPERTEXT AND HYPERMEDIA? WWW stands for "World Wide Web". The WWW project, started by CERN (the European Laboratory for Particle Physics), seeks to build a distributed hypermedia system. The advantage of hypertext is that in a hypertext document, if you want more information about a particular subject mentioned, you can usually "just click on it" to read further detail. In fact, documents can be and often are linked to other documents by completely different authors -- much like footnoting, but you can get the referenced document instantly! To access the web, you run a browser program. The browser reads documents, and can fetch documents from other sources. Information providers set up hypermedia servers which browsers can get documents from. The browsers can, in addition, access files by FTP, NNTP (the Internet news protocol), gopher and an ever-increasing range of other methods. On top of these, if the server has search capabilities, the browsers will permit searches of documents and databases. The documents that the browsers display are hypertext documents. Hypertext is text with pointers to other text. The browsers let you deal with the pointers in a transparent way -- select the pointer, and you are presented with the text that is pointed to. Hypermedia is a superset of hypertext -- it is any medium with pointers to other media. This means that browsers might not display a text file, but might display images or sound or animations. _________________________________________________________________ _World Wide Web FAQ_ WHAT IS A URL? URL stands for "Uniform Resource Locator". It is a draft standard for specifying an object on the Internet, such as a file or newsgroup. URLs look like this: (file: and ftp: URLs are synonymous.) * file://wuarchive.wustl.edu/mirrors/msdos/graphics/gifkit.zip * ftp://wuarchive.wustl.edu/mirrors * http://www.w3.org:80/default.html * news:alt.hypertext * telnet://dra.com The first part of the URL, before the colon, specifies the access method. The part of the URL after the colon is interpreted specific to the access method. In general, two slashes after the colon indicate a machine name (machine:port is also valid). When you are told to "check out this URL", what to do next depends on your browser; please check the help for your particular browser. For the line-mode browser at CERN, which you will quite possibly use first via telnet, the command to try a URL is "GO URL" (substitute the actual URL of course). In Lynx you just select the "GO" link on the first page you see; in graphical browsers, there's usually an "Open URL" option in the menus. _________________________________________________________________ _World Wide Web FAQ_ WHAT ARE SGML AND HTML? Documents on the World Wide Web are written in a simple "markup language" called HTML, which stands for Hypertext Markup Language. SGML is a much broader language which is used to define particular markup languages for particular purposes. HTML is just a specific application of SGML. You can learn more about SGML, and the rationale behind HTML, by reading A Gentle Introduction to SGML (URL is ), a document provided by the Text Encoding Initiative. _________________________________________________________________ _World Wide Web FAQ_ HOW DOES WWW COMPARE TO GOPHER AND WAIS? While all three of these information presentation systems are client-server based, they differ in terms of their model of data. In gopher, data is either a menu, a document, an index or a telnet connection. In WAIS, everything is an index and everything that is returned from the index is a document. In WWW, everything is a (possibly) hypertext document which may be searchable. In practice, this means that WWW can represent the gopher (a menu is a list of links, a gopher document is a hypertext document without links, searches are the same, telnet sessions are the same) and WAIS (a WAIS index is a searchable page, returning a document with no links) data models as well as providing extra functionality. World Wide Web usage grew far beyond Gopher usage in the last few months, according to the statistics-keepers of the Internet backbone. (Of course, World Wide Web browsers can also access Gopher servers, which inflates the numbers for the latter.) WWW has long since reached critical mass, with new commercial and noncommercial sites appearing daily. _________________________________________________________________ _World Wide Web FAQ_ WHAT IS THE W3 CONSORTIUM? The W3 consortium is an industry consortium headed by the Laboratory for Computer Science at the Massachusetts Institute of Technology. The W3 consortium seeks to promote standards and encourage interoperability between WWW products. See for more information. _________________________________________________________________ _World Wide Web FAQ_ INTRODUCTION: HOW CAN I ACCESS THE WEB? You have two basic options: use a browser on your own machine (the best option) or use a browser that can be telnetted to (not nearly as good, but possible). Web access by email is not available at this time. Note, however, that the traditional online services such as AOL, Prodigy, and Compuserve now offer web access of one degree or another as a standard feature. It is always best to run a browser on your own machine, unless you absolutely cannot do so; but feel free to telnet to a browser for your first look at the web, or use email if the telnet command does not work on your system (_try it first!_). Note that "your machine" can be defined as a system you dial into from home, such as netcom or another account provider. Running a text-based browser on such a system is still preferable to telnetting to a faraway site. Access to the web by email has been possible at various points in time, but the volume of incoming requests simply cannot be handled by any one central site. Obtaining a better grade of Internet access that allows you to run a web browser is strongly encouraged. There is one low-tech solution: web by FAX! Consider the following information, submitted by Bill Stearns: If you have access to a fax machine, do the following: 1) Call 805-730-7777 from your fax machine. 2) Select number 2 (I have the document ID already) 3) Type in the document ID; for the above page, it's 17571, then press # (the pound symbol) 4) Press pound at the next prompt if you're calling from your fax machine, or enter the phone number of your fax machine and then press pound. 5) Wait for that page to come over, and then repeat the process with the 5 or 6 digit number in brackets next to the link you'd like to follow. A few other useful pages: 17581 800 number (toll-free) service providers 17582 The list of area codes - a good place to start as well if you're in the U.S. By the way, this free service is provided by Universal Access (http://www.ua.com/, document number 16968) and is not limited to just this directory. If you know the name of the machine hosting the web page you want to view, you can probably reach it through this service. You simply type in the name of the machine (www.teleport.com, for example) at menu option 3. When you've received the home page for the site, keep following the trail to the page you'd like. It takes a while and some long distance calls, but the service is otherwise free. My sincere thanks both to Universal Access and the Celestin company for providing these services. _________________________________________________________________ _World Wide Web FAQ_ WHAT IS ON THE WEB? Currently accessible through the web: * anything served through gopher * anything served through WAIS * anything on an FTP site * anything on Usenet * anything accessible through telnet * anything in hytelnet * anything in hyper-g * anything in techinfo * anything in texinfo * anything in the form of man pages * sundry hypertext documents _________________________________________________________________ _World Wide Web FAQ_ HOW DO I FIND OUT WHAT'S NEW ON THE WEB? comp.infosystems.www.announce The newsgroup comp.infosystems.www.announce carries announcements of new resources on the World Wide Web. Since newsgroups are distributed, it can be accessed reliably even when the net is very busy. What's New With NCSA Mosaic The unofficial newspaper of the World Wide Web is What's New With NCSA Mosaic (URL is http://www.ncsa.uiuc.edu/SDG/Software/Mosaic/Docs/whats-new.htm l ), which carries announcements of new servers on the web and also of new web-related tools. This should be in your hot list if you're not using Mosaic (which can access it directly through the help menu). comp.internet.net-happinings You can also check out the newsgroup comp.internet.net-happenings, which carries WWW announcements and many other Internet-related announcements. _________________________________________________________________ _World Wide Web FAQ_ WHERE IS THE SUBJECT CATALOG OF THE WEB? There are several. There is no mechanism inherent in the web which forces the creation of a single catalog (although there is work underway on automatic mechanisms to catalog web sites). The best-known catalog, and the first, is The WWW Virtual Library (URL is http://www.w3.org/hypertext/DataSources/bySubject/Overview.html ), maintained by CERN. The Virtual Library is a good place to find resources on a particular subject, and has separate maintainers for many subject areas. Yahoo (URL is ) is probably the most complete hierarchical, topical index of web sites, and also features a sophisticated search facility. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I SEARCH THROUGH ALL WEB SITES? Several people have written robots which create indexes of web sites -- including sites which have not arranged to be mentioned in the newspapers and catalogs above. (Before writing your own robot, please read the entry in the authoring section regarding robots.) Here are a few such automatic indexes you can search: Yahoo (URL is ) is probably the most complete hierarchical, topical index of web sites, and also features a sophisticated search facility. Lycos (URL is http://fuzine.mt.cs.cmu.edu/mlm/lycos-home.html ) is another web-indexing robot, which includes the ability to submit the URLs of your own documents by hand, ensuring that they are available for searching. WebCrawler (URL is ) builds an impressively complete index; on the other hand, since it indexes the content of documents, it may find many links that aren't exactly what you had in mind. However, it does a good job of sorting the documents it finds according to how closely they match your search. World Wide Web Worm (URL is http://www.cs.colorado.edu/home/mcbryan/WWWW.html ) builds its index based on page titles and URL contents only. This is somewhat less inclusive, but pages it finds are more likely to be an exact match with your needs. InfoSeek is a commercial search service which also offers a free web search facility . You can specify phrases to locate, among other query operations, and InfoSeek's commercial service can search more than just web pages (newsgroups, for instance). InfoSeek's commercial service charges 10 cents per query and offers a free trial to new users. (Increasing load on the free search servers makes this sound better every day.) OpenText (URL is ) also offers a robust web searching facility. You can read about other search robots and the principles behind them in the robots section. _________________________________________________________________ _World Wide Web FAQ_ CAN I CATCH A VIRUS BY LOOKING AT A WEB PAGE? _No._ Your computer can, of course, catch a virus if you download an executable program from an untrustworthy site and then, of your own free will, double-click on it in your file manager (or Mac desktop, or...). This is the same risk you run when downloading programs from bulletin board systems or via anonymous FTP. Viewing images, filling out forms and so on is harmless. So, most likely, is downloading a program from a respectable source with a reputation to protect. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I FIND OUT IF A WEB PAGE HAS BEEN UPDATED? Most of the time, web servers deliver information only when you ask for it. Usually this is a good thing, but in some cases you may want to be notified when a web page has changed. When you want notification that a page has changed, consider using URL-minder (URL is ), a web-browsing robot which will automatically notify you by email when a page of interest to you has been updated. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I PROVIDE INFORMATION TO THE WEB? Information providers run programs that the browsers can obtain hypertext from. These programs can either be WWW servers that understand the HyperText Transfer Protocol HTTP (best if you are creating your information database from scratch), "gateway" programs that convert an existing information format to hypertext, or a non-HTTP server that WWW browsers can access -- anonymous FTP or gopher, for example. To learn more about World Wide Web servers, see the server section. You can also consult a www server primer by Nathan Torkington, available at the URL http://www.vuw.ac.nz/who/Nathan.Torkington/ideas/www-servers.html . If you only want to provide information to local users, placing your information in local files is also an option. This means, however, that there can be no off-machine access. _________________________________________________________________ _World Wide Web FAQ_ WHO USES THE WEB? Good question! The web is certainly biased toward the thirtyish, anglo-saxon, male and technology-friendly crowd at this point, but there's more to the story; the demographics of the web are changing rapidly as the user base grows. The GVU WWW User Survey (URL is ) attempts to answer the question in detail. You can access the results of past surveys and contribute information of your own. _________________________________________________________________ _World Wide Web FAQ_ WHAT IS VRML? VRML, the Virtual Reality Modeling Language, is an attempt to extend the web into the domain of three-dimensional graphics. VRML "worlds" can depict realistic or otherworldly places, which can contain objects that link to other documents or VRML worlds on the web. For more information about VRML, including where to find browsers and other VRML tools for your system, consult the VRML Home Page at Wired (URL is ) for general technical information about the effort, and the WebSpace home page at SGI (URL is ) for the first VRML viewer to become available. _________________________________________________________________ _World Wide Web FAQ_ WHAT IS JAVA? Java is a language developed by Sun Microsystems which allows World Wide Web pages to contain code that is executed on the browser. Because Java is based on a single "virtual machine" that all implementations of java emulate, it is possible for Java programs to run on any system which has a version of Java. It is also possible for the "virtual machine" emulator to make sure that Java programs downloaded through the web do not attempt to do unauthorized things. Actually, Java can be used in the absence of the web, but the application that has sparked so much interest in Java is HotJava, a web browser written in the Java language. You can learn more about Java and HotJava from Sun's HotJava home page (URL is ). _________________________________________________________________ _World Wide Web FAQ_ BROWSERS ACCESSIBLE BY TELNET An up-to-date list of these is available on the Web as http://www.w3.org/hypertext/WWW/FAQ/Bootstrap.html and should be regarded as an authoritative list. telnet.w3.org A telnettable browser provided by the W3 coalition. www.cc.ukans.edu Offers Lynx, a full screen browser which requires a vt100 terminal. Log in as www. Does not allow users to "go" to arbitrary URLs, so GET YOUR OWN COPY of Lynx and install it on your system if your administrator has not done so already. Lynx is the best plain-text browser, so move mountains if necessary to get your own copy of Lynx! www.njit.edu (or telnet 128.235.163.2) Log in as www. A full-screen browser in New Jersey Institute of Technology. USA. www.huji.ac.il A dual-language Hebrew/English database, with links to the rest of the world. The line mode browser, plus extra features. Log in as www. Hebrew University of Jerusalem, Israel. info.funet.fi (or telnet 128.214.6.102). Log in as www. Offers several browsers, including Lynx. fserv.kfki.hu Hungary. Has slow link, use from nearby. Login is as www. _________________________________________________________________ _World Wide Web FAQ_ AMIGA BROWSERS AMosaic Browser for AmigaOS, based on NCSA's Mosaic. Supports older Amigas as well as the newer machines in the latest versions; available for anonymous ftp from max.physics.sunysb.edu in the directory /pub/amosaic, or from aminet sites in /pub/aminet/comm/net. see the site for details. See . See also the FAQ available at . Amiga Lynx An Amiga version of the Lynx text-based browser. Supports forms, while AMosaic does not. See . Emacs w3-mode A WWW browser for emacs. Runs under Gnu Emacs on the Amiga. Has fonts, color, inline images, and mouse support if using Lemacs, Epoch, or Emacs 19. Available by anonymous ftp from ftp.cs.indiana.edu in the directory pub/elisp/w3. _________________________________________________________________ _World Wide Web FAQ_ MACINTOSH BROWSERS NOTE: These browsers require that you have SLIP, PPP or other TCP/IP networking on your PC. SLIP or PPP can be accomplished over phone lines. You can do this one of two ways: using a proper SLIP account, which requires the active cooperation of your network provider or educational institution (see Frank Hecker's guide to SLIP and PPP access; URL is ; ), or using The Internet Adapter or SLiRP, products which simulate SLIP through your dialup Unix shell account. If you only have non-Unix based dialup shell access, or have no PC at home, your best option at this time is to run Lynx on the VMS (or Unix, or...) system you call, or telnet to a browser if you cannot do so. NCSA Mosaic for Macintosh From NCSA. Full featured. Available by anonymous FTP from ftp.ncsa.uiuc.edu in the directory Mac/Mosaic. Netscape From Netscape Communications Corp (URL is ). Downloads and displays images incrementally while you read pages, which also display incrementally. Also supports tables in a standard manner, in addition to many extensions to HTML, not all of which conform to the proposed standard. Netscape is a commercial product but can be evaluated free of charge for 90 days by individuals. Available by anonymous FTP from ftp.netscape.com in the netscape subdirectory. See Netscape's web site for information about mirror sites. MacWeb From EINet. Has features that Mosaic lacks; lacks some features that Mosaic has. Available by anonymous FTP from ftp.einet.net in the directory einet/mac/macweb. Emacs w3-mode A WWW browser for emacs. Runs under Xwindows, NeXTstep, VMS, OS/2, Windows NT, Windows 3.1, AmigaDOS, or just about any Unix system. Also has fonts, color, inline images, and mouse support if using Lemacs, Epoch, or Emacs 19. Also works in local mode under DOS and on the Macintosh. Available by anonymous ftp from ftp.cs.indiana.edu in the directory pub/elisp/w3. Enhanced Mosaic Enhanced Mosaic, from Spyglass, Incorporated, is the commercial version of NCSA Mosaic. Spyglass does not offer the browser directly to the public; instead, they license it to various OEMs. You can learn more about their licensing arrangements and the existing licensees from the Spyglass home page (URL is ). _________________________________________________________________ _World Wide Web FAQ_ MSDOS BROWSERS NOTE: These browsers require that you have SLIP, PPP or other TCP/IP networking on your PC. SLIP or PPP can be accomplished over phone lines. You can do this one of two ways: using a proper SLIP account, which requires the active cooperation of your network provider or educational institution, or using The Internet Adapter or SLiRP, products which simulate SLIP through your dialup Unix shell account. If you only have non-Unix based dialup shell access, or have no PC at home, your best option at this time is to run Lynx on the VMS (or Unix, or...) system you call, or telnet to a browser if you cannot do so. DosLynx DosLynx is an excellent text-based browser for use on DOS systems. You must have a level 1 packet driver, or an emulation thereof, or you will only be able to browse local files; essentially, if your PC has an Ethernet connection, or you have SLIP, you should be able to use it. DosLynx can view GIF images, but not when they are inline images (as of this writing). See the README.HTM file at the DosLynx site for details. You can obtain DosLynx by anonymous FTP from ftp2.cc.ukans.edu in the directory pub/WWW/DosLynx; the URL is ftp://ftp2.cc.ukans.edu/pub/WWW/DosLynx/. Minuet An all-in-one Internet access package for MSDOS. Includes both text-mode and graphics-mode display. Available by anonymous FTP from minuet.micro.umn.edu in the directory pub/minuet/latest/minuarc.exe. _________________________________________________________________ _World Wide Web FAQ_ NEXTSTEP BROWSERS Note: NeXTStep systems can also run X-based browsers using one of the widely used X server products for the NeXT. The browsers listed here, by contrast, are native NeXTStep applications. SpiderWoman A multithreaded, graphical browser for NeXTStep. Available by anonymous FTP from sente.epfl.ch in the directory pub/software (URL is ). Netsurfer Another true NeXTStep browser. Available by anonymous FTP from ftp.thoughtport.com in the directory /pub/next/netsurfer (URL is ). OmniWeb A World Wide Web browser for NeXTStep. The URL for more information is http://www.omnigroup.com/; you can ftp the package from ftp.omnigroup.com in the /pub/software/ directory. WorldWideWeb, CERN's NeXT Browser-Editor A browser/editor for NeXTStep. _Currently out of date; editor not operational._ Allows wysiwyg hypertext editing. Requires NeXTStep 3.0. Available for anonymous FTP from ftp.w3.org in the directory /pub/www/src. Emacs w3-mode A WWW browser for emacs. Runs under Xwindows, NeXTstep, VMS, OS/2, Windows NT, Windows 3.1, AmigaDOS, or just about any Unix system. Also has fonts, color, inline images, and mouse support if using Lemacs, Epoch, or Emacs 19. Also works in local mode under DOS and on the Macintosh. Available by anonymous ftp from ftp.cs.indiana.edu in the directory pub/elisp/w3. _________________________________________________________________ _World Wide Web FAQ_ TEXT-MODE UNIX AND VMS BROWSERS These are text-based browsers for Unix (and in some cases also VMS) systems. In many cases your system administrator will have already installed one or more of these packages; check before compiling your own copy. Line Mode Browser This program gives W3 readership to anyone with a dumb terminal. A general purpose information retrieval tool. Available by anonymous ftp from www.w3.org in the directory /pub/www/src. The "Lynx" full screen browser This is a hypertext browser for vt100s using full screen, arrow keys, highlighting, etc. Available by anonymous FTP from ftp2.cc.ukans.edu. Tom Fine's perlWWW A tty-based browser written in perl. Available by anonymous FTP from archive.cis.ohio-state.edu in the directory pub/w3browser as the file w3browser-0.1.shar. For VMS Dudu Rashty's full screen client based on VMS's SMG screen management routines. Available by anonymous FTP from vms.huji.ac.il in the directory www/www_client. Emacs w3-mode A WWW browser for emacs. Runs under Xwindows, NeXTstep, VMS, OS/2, Windows NT, Windows 3.1, AmigaDOS, or just about any Unix system. Also has fonts, color, inline images, and mouse support if using Lemacs, Epoch, or Emacs 19. Also works in local mode under DOS and on the Macintosh. Available by anonymous ftp from ftp.cs.indiana.edu in the directory pub/elisp/w3. _________________________________________________________________ _World Wide Web FAQ_ VM/CMS BROWSERS Albert A WWW browser for the VM/CMS operating system. Available by anonymous FTP from ftp.nerdc.ufl.edu in the directory pub/vm/www/. Charlotte A full-screen VM/CMS browser written in REXX, Pipelines and REXX Sockets which runs without changes on any version of CMS from 5 to 11. (URL is ). _________________________________________________________________ _World Wide Web FAQ_ MICROSOFT WINDOWS BROWSERS NOTE: Most of these browsers require that you have SLIP, PPP or other TCP/IP networking on your PC. The exceptions are SlipKnot and I-COMM, which have slightly more limited features but operate without a proper Internet connection. SLIP or PPP can be accomplished over phone lines. You can do this one of two ways: using a proper SLIP account, which requires the active cooperation of your network provider or educational institution (see Frank Hecker's guide to SLIP and PPP access; URL is ), or by using The Internet Adapter or SLiRP, products which simulate SLIP through your dialup Unix shell account. Another product, TwinSock at , provides equivalent functionality under Windows using its own proxy protocol. If you only have non-Unix based dialup shell access, or have no PC at home, your best option at this time is to run Lynx on the VMS (or Unix, or...) system you call, or telnet to a browser if you cannot do so. Cello Browser from Cornell LII. Available by anonymous FTP from ftp.law.cornell.edu in the directory /pub/LII/cello. Mosaic for Windows From NCSA. Available by anonymous FTP from ftp.ncsa.uiuc.edu in the directory PC/Windows/Mosaic, or learn more about it on the web: WinWeb From EINet. Available by anonymous FTP from ftp.einet.net in the directory /einet/pc/winweb as the file winweb.zip. Netscape From Netscape Communications Corp (URL is: ). Downloads and displays images incrementally while you read pages, which also display incrementally, making it the best browser at the time of this writing for those who connect to the web via modems. Also supports tables in a standard manner, in addition to many extensions to HTML, not all of which conform to the proposed standard. Netscape is a commercial product but can be evaluated free of charge for 90 days by individuals. The 16-bit version works under both OS/2 and Windows. Available by anonymous FTP from ftp.netscape.com in the netscape subdirectory. See Netscape's web site for information about mirror sites. Quarterdeck Mosaic From Quarterdeck. Supports incremental image loading; available for beta test (URL is ). Compuserve Mosaic From Compuserve (Spry is now part of Compuserve). Works under Windows and OS/2. Supports the mailto: URL, transparent GIFs, ALT tags, hierarchical hotlists, progressive image rendering, and so forth. Internetworks From Internetworks, formerly (?) Booklink. Available by anonymous FTP from ftp.booklink.com in the directory lite; this is a demonstration version of the full browser, which costs $99. Booklink can open many simultaneous connections in different windows and display images and pages progressively; at the time of this writing it is the only browser to equal Netscape in this area. The "lite" version can only open two simultaneous connections, however. SlipKnot SlipKnot (like I-COMM) is a graphical WWW browser that operates entirely without SLIP, PPP, an Ethernet connection, or special server-side software (but read the SLIP emulator section for another workaround). Like I-COMM, SlipKnot supports multiple fonts, inline images, forms, and review of documents you have already received while new documents arrive, and it operates entirely through your regular Unix shell account. SlipKnot does _not_ require that you install any new software on your Unix shell account. You can obtain SlipKnot by anonymous FTP from oak.oakland.edu in the directory SimTel/win3/internet. For more information, see the SlipKnot information page (URL is http://www.interport.net/slipknot/slipknot.html ) or send a blank email message to slipknot@micromind.com. I-COMM I-COMM, like SlipKnot, operates without a true TCP/IP connection. It requires a Unix shell account, like SlipKnot, or a VMS shell account, a feature unique to I-COMM. I-COMM also features Zmodem file transfers in both directions and complete support for forms. I-COMM is available for evaluation as shareware (URL is ). IBM OS/2 WebExplorer A native IBM OS/2 web browser. WebExplorer is a multithreaded application and, in addition to the usual "back" and "forward" buttons, features a visual map of your exploration of the web. The software supports progressive image rendering. IBM WebExplorer can be acquired by anonymous FTP from ftp01.ny.us.ibm.net in the directory pub/WebExplorer/ . WebSurfer Included with the Chameleon TCP/IP software package from Netmanage, Inc. Reputedly functional and straightforward. Emacs w3-mode A WWW browser for emacs. Runs under Xwindows, NeXTstep, VMS, OS/2, Windows NT, Windows 3.1, AmigaDOS, or just about any Unix system. Also has fonts, color, inline images, and mouse support if using Lemacs, Epoch, or Emacs 19. Also works in local mode under DOS and on the Macintosh. Available by anonymous ftp from ftp.cs.indiana.edu in the directory pub/elisp/w3 . Enhanced Mosaic Enhanced Mosaic, from Spyglass, Incorporated, is the commercial version of NCSA Mosaic. Spyglass does not offer the browser directly to the public; instead, they license it to various OEMs. You can learn more about their licensing arrangements and the existing licensees from the Spyglass home page (URL is ). UdiWWW UdiWWW, unlike all other Windows browsers as of this writing, supports all of the proposed HTML 3.0 standard (except for and ) and also supports Netscape's various nonstandard extensions. UdiWWW is still being tested, but you can obtain it for yourself and see (URL is ). Emissary Emissary, from Wollongong, is both a web browser and a concerted effort to integrate the Internet into the Windows environment (see ). For instance, FTP sites appear much like drives in the file manager, mail can be sent via drag and drop, and WYSIWYG HTML editing is available. Emissary supports several Netscape extensions, but lacks support for tables. _________________________________________________________________ _World Wide Web FAQ_ X/DECWINDOWS (GRAPHICAL UNIX, VMS) BROWSERS NCSA Mosaic for X Unix browser using X11/Motif. The original multimedia browser. Full http 1.0 support including PUT-method forms, image maps, etc. Recent beta versions have limited support for tables. Available by anonymous FTP from ftp.ncsa.uiuc.edu in the directory Mosaic. NCSA Mosaic for VMS Browser using X11/DecWindows/Motif. For the VMS operating system. Full http 1.0 support including PUT-method forms, image maps, etc. Probably the best browser available for VMS. Available by anonymous FTP from ftp.digital.com in the directory pub/DEC/Mosaic. Netscape From Netscape Communications Corp (URL is: ). Downloads and displays images incrementally while you read pages, which also display incrementally. Also supports tables in a standard manner, in addition to many extensions to HTML, not all of which conform to the proposed standard. Netscape is a commercial product but can be evaluated free of charge for an unlimited period of time by individuals. The 16-bit version works under both OS/2 and Windows. Available by anonymous FTP from ftp.netscape.com in the netscape subdirectory. See Netscape's web site for information about mirror sites. Quadralay GWHIS Viewer (Commercial Mosaic) Quadralay offers a commercial-grade (not free!) version of Mosaic for Unix systems, with Windows and Macintosh versions expected in the future. (URL is: http://www.quadralay.com/products/products.html#gwhis) tkWWW Browser/Editor for X11 A Unix Browser/Editor for X11 (URL is ). Supports WSYIWYG HTML editing. MidasWWW Browser A Unix/X browser from Tony Johnson. (Beta, works well.) Viola for X (Beta) Viola has two versions for Unix/X: one using Motif, one using Xlib (no Motif). Handles HTML Level 3 forms and tables. Has extensions for multiple columning, collapsible/expandable list, client-side document include. Available by anonymous FTP from ora.com in /pub/www/viola. More information available at the URL http://xcf.berkeley.edu/ht/projects/viola/README. Chimera Unix/X Browser using Athena (doesn't require Motif). Supports forms, inline images, etc.; closest to Mosaic in feel of the non-Motif X11 browsers. Available for anonymous FTP from ftp.cs.unlv.edu in the directory /pub/chimera. Emacs w3-mode A WWW browser for emacs. Runs under Xwindows, NeXTstep, VMS, OS/2, Windows NT, Windows 3.1, AmigaDOS, or just about any Unix system. Also has fonts, color, inline images, and mouse support if using Lemacs, Epoch, or Emacs 19. Also works in local mode under DOS and on the Macintosh. Available by anonymous ftp from ftp.cs.indiana.edu in the directory pub/elisp/w3. Arena Arena's primary purpose is to be a testbed for HTML Level 3 documents. As a result, Arena supports many of the new and interesting features of HTML Level 3. As of this writing it is still in prerelease and expectations should be set accordingly! Available by anonymous FTP from ftp.w3.org in the directory pub/www/arena/ . Enhanced Mosaic Enhanced Mosaic, from Spyglass, Incorporated, is the commercial version of NCSA Mosaic. Spyglass does not offer the browser directly to the public; instead, they license it to various OEMs. You can learn more about their licensing arrangements and the existing licensees from the Spyglass home page (URL is ). _________________________________________________________________ _World Wide Web FAQ_ WHAT BROWSERS ARE AVAILABLE FOR THE ACORN RISCOS SYSTEM? arcweb ArcWeb is a World-Wide Web browser for Acorn RISC OS computers, with RISC OS 3.1 or later. webster Another browser, about which I have no further information. Both browsers can be obtained by anonymous FTP from micros.hensa.ac.uk in the directory micros/arch/riscos. _________________________________________________________________ _World Wide Web FAQ_ BATCH-MODE "BROWSERS" The following browsers retrieve the contents of the URL specified on the command line and are intended primarily for use in scripts. Note that most of the text-based Unix browsers can also do this. Batch mode browser A batch-mode "browser", url_get, which is available through the URL http://www.utexas.edu/~zippy/url_get.html . It can be retrieved via anonymous FTP to ftp.cc.utexas.edu, as the file /pub/zippy/url_get.tar.Z. This package is intended for use in cron jobs and other settings in which fetching a page in a command-line fashion is useful. Batch mode browser in tclX A batch mode "browser" (URL retriever) written in extended Tcl (tclX) is available as well (URL is ). _________________________________________________________________ _World Wide Web FAQ_ I CAN'T GET SLIP OR PPP. I WANT WEB ACCESS. IS THERE A WAY? YES! If you have a plain old Unix shell account on a Unix system, such as a SunOS or Ultrix system, there are two ways around the problem: GUI Browsers that Talk to Unix Microsoft Windows users can run SlipKnot or ICOMM, special browsers which operate using programs that may already be installed on your shell account (covered in detail in the MS Windows browsers section). SLIP/PPP Emulators Anyone with dialup access to a Unix shell account can use The Internet Adapter (TIA) or SLiRP, two programs which provide a pseudo-SLIP connection. SLiRP is free. TIA is not free, but there is a free two-week trial period and it is inexpensive. You can learn more about TIA at . More information on SLiRP is available at . If you have a Macintosh, check out the Macintosh TIA Users' FAQ, , for additional help. "So what do I run on my machine at home?" Exactly the same software you would use for real SLIP; as far as your PC is concerned, it _is_ a SLIP connection. If you're unfamiliar with SLIP please check out a newsgroup relevant to your particular type of machine (Windows, Mac, or even Unix-based). _________________________________________________________________ _World Wide Web FAQ_ CAN I BROWSE HTML FILES LOCALLY WHEN I'M OFFLINE? If you do not use Microsoft Windows, the answer is usually "no problem!" Just use the "Open File" or equivalent option on the file menu of your web browser, instead of "Open Location" or "Open URL". If you use Microsoft Windows, and particularly if you use Netscape, this may be a problem. Some web browsers will refuse to run unless there is functioning Internet software running on the system. Netscape offers a solution to this problem in the release notes to version 1.1 of their product. Essentially, you can install an "empty" Internet interface (winsock.dll) that keeps Netscape happy. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I ACCESS THE WEB THROUGH A FIREWALL? A "proxy server" is a specialized HTTP server which (typically) runs on a firewall machine, providing access to the outside world for people inside the firewall. The CERN httpd can be configured to run as a proxy. Furthermore, it is able to perform caching of documents, resulting in faster response times. If you cannot arrange to run a proxy server (definitely the recommended approach), read on: For information on using NCSA Mosaic from behind a firewall, please read the following. In general, browsers can be made useful behind firewalls through the use of a package called "SOCKS"; the source must be modified slightly and rebuilt to accommodate this. Whenever possible, work _with_ your network administrators to solve the problem, not against them. An excerpt from the NCSA Mosaic FAQ: NCSA Mosaic requires a direct internet connection to work, but some folks have put together a package that works behind firewalls. This is _completely unsupported_ by NCSA, but here is the latest announcement: _November 15, 1993:_ C&C Software Technology Center (CSTC) of NEC Systems Lab has made available a version of SOCKS, a package for running Internet clients from behind firewalls without breaching security requirements, that includes a suitably modified version of Mosaic for X 2.0. _Beware: such a version is not supported by NCSA; we can't help with questions or problems arising from the modifications made by others._ But, we encourage you to check it out if it's interesting to you. Questions and problem notifications can be sent to Ying-Da Lee (_ylee@syl.dl.nec.com_). _________________________________________________________________ _World Wide Web FAQ_ I'M RUNNING XMOSAIC. WHY CAN'T I GET EXTERNAL VIEWERS WORKING? Answer provided by Ronald E. Daniel (rdaniel@acl.lanl.gov): Mosaic only looks at the .mime.types file if it has no idea what the document's type is. This is actually a very rare situation. Essentially all servers now use the HTTP/1.0 protocol, which means that they tell Mosaic (or other browsers) what the document's MIME Content-type is. The servers use a file very much like Mosaic's .mime.types file to infer the Content-type from the filename's extension. It is pretty simple to find out if this really is the problem. Use telnet to talk to the server and find out if it is assigning a MIME type to the document in question. Here's an example, looking at the home page for my server. (idaknow: is my shell prompt) idaknow: telnet www.acl.lanl.gov 80 // Connect to the httpd server Trying 128.165.148.3 ... Connected to www.acl.lanl.gov. Escape character is '^]'. HEAD /Home.html HTTP/1.0 // replace Home.html with your documen t // you supply the blank line HTTP/1.0 200 OK // the rest of this comes from the serve r Date: Wednesday, 25-May-94 19:18:11 GMT Server: NCSA/1.1 MIME-version: 1.0 Content-type: text/html // Here's the MIME Content-type Last-modified: Monday, 16-May-94 16:21:58 GMT Content-length: 1727 Connection closed by foreign host. idaknow: In the example above, /Home.html will get http://www.acl.lanl.gov/Home.html . Normally servers will be configured to supply a Content-type of text/plain if they don't know what else to do. If this is the problem you are having, take a look at the TypesConfig documentation for NCSA's httpd. You can have the server look at the filename extension, supply the correct Content-type, then use your local .mailcap file to tell Mosaic what viewer to use to look at the document. Russ Segal adds: The answer from Ronald Daniel is essentially correct, but it needs a small addendum. When starting Moasic, you can specify a "fileProxy" which will fetch files for you: "*fileProxy: http://socks/" If you do this, file: URLs are no longer strictly local accesses. So even if the URL is not http:, the proxy server must be upgraded as Mr. Daniel suggests. _________________________________________________________________ _World Wide Web FAQ_ I HAVE A WINDOWS PC OR MACINTOSH. WHY CAN'T I ACCESS WAIS URLS? This answer provided by Michael Grady (m-grady@uiuc.edu): The version of Mosaic for X has "wais client" code built-in to it. This was relatively easy for the developers to do, because there was already a set of library routines for talking to WAIS available for Unix as "public domain" (freeWAIS). I don't think there is such a library of routines for PC/Windows or Mac, which would make it much more difficult for the Mosaic versions for Windows and the Mac to add "wais client" capability. Therefore, at least for now, neither the Windows or Mac versions of Mosaic support direct query of a WAIS server (i.e. can act as wais clients themselves). _________________________________________________________________ _World Wide Web FAQ_ HOW DO I PRINT LEGIBLE PLAIN-ASCII VERSIONS OF HTML PAGES? There are several ways. Most web browsers have a "save as ascii" option; the quality of the result varies. Lynx, in particular, being a text-based browser, does a credible job if you select the print option and choose "print to local file" instead of an actual printer. A product designed expressly for this purpose is HTMLCon (URL is ), a DOS command line application. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I SAVE AN INLINE IMAGE TO DISK? Here are three ways: 1. If you are using Netscape, just hold down the right mouse button (hold down the single mouse button for more than a second if using the Mac version) over the image. A menu will appear that includes the option of saving the image. 2. Turn on "load to local disk" in your browser, if it has such an option; then reload images. You'll be prompted for filenames instead of seeing them on the screen. Be sure to shut it off when you're done with it. 3. Choose "view source" and browse through the HTML source; find the URL for the inline image of interest to you; copy and paste it into the "Open URL" window. This should load it into your image viewer instead, where you can save it and otherwise muck about with it. _________________________________________________________________ _World Wide Web FAQ_ HOW DO I SEND NEWSGROUP POSTS IN HTML TO MY WEB CLIENT? How to do this depends greatly on your system; if you have a Mac or Windows system, the answer is completely different. But, as food for thought, here is a simple shell script I use on my Unix account to send posts from rn and related newsreaders to Lynx. Put this text in the file "readwebpost" and use the "chmod" command to make it executable, then put it somewhere in your path (such as your personal bin directory): #!/bin/sh echo \ > .article.html cat >> .article.html echo \ >> .article.html lynx .article.html < /dev/tty rm .article.html Then add the following line to your .rnmac file (create it if you don't already have one): W |readwebpost %C Now, when you press "W" while reading a post in rn, a message will be sent to Lynx, and the links enclosed in it will be live. Larry W. Virden provides the following version which invokes Mosaic instead, and is also capable of communicating with an already-running copy of Mosaic instead of launching another. (You can use the same rn macro as above, invoking "goto-xm" instead of "readwebpost".) Read the comments for details on the assumptions made by the script. #! /bin/sh # goto-xm, by Joseph T. Buck # Modified heavily by Larry W. Virden # Script for use with newsreaders such as trn. Piping the article # through this command causes xmosaic to pop up, pointing to the # article. If an existing xmosaic (version 1.1 or later) exists, # the USR1 method will be used to cause it to point to the correct # article, otherwise a new one will be started. # assumptions: ps command works as is on SunOS 4.1.x, may need changes # on other platforms. URL=`/bin/grep '^Message-ID:' | /bin/sed -e 's/.*.*//'` if [ "X$URL" = "X" ]; then echo "USAGE: $0 [goto] [once] < USENET_msg" >&2 exit 1 fi pid=`ps -xc | egrep '[Mm]osaic' | awk 'NR == 1 {print $1}'` p=`which Mosaic` gfile=/tmp/Mosaic.$pid $p "$URL" & if [ "$#" -gt 0 ] ; then if [ "$1" = "goto" -o "$1" = "same" ] ; then shift echo "goto" > $gfile else echo "newwin" > $gfile fi else echo "newwin" > $gfile fi /bin/awk 'END { printf "'"$URL"'" }' > $gfile trap "echo signal encountered" 30 kill -USR1 $pid exit 0 See also MosaicMail (URL is http://www.oac.uci.edu/indiv/ehood/mhonarc.doc.html ), a Perl script which pipes email and/or news to your current Mosaic session. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I GET SOUND FROM THE PC SPEAKER WITH WINMOSAIC? This piece of wisdom donated by Hunter Monroe: This section explains how to install sound on a PC which already has a working version of Mosaic for Microsoft Windows. Be warned in advance that the results may be poor. To get Mosaic to produce sound out of the PC speaker, first, you need a driver for the speaker. You can get the Microsoft speaker driver from the URL ftp://ftp.microsoft.com/Softlib/MSLFILES/SPEAK.EXE or by doing an Archie search to find it somewhere else. SPEAK.EXE is a self-extracting file. Copy the speak.exe file to a new directory, and then type "SPEAK" at the DOS prompt. Do not put the file SPEAKER.DRV in a separate directory from OEMSETUP.INF. Now, you need to install the driver. In Windows, from the Program Manager choose successively Main/Control Panel/Drivers/Add/Unlisted or updated drivers/(enter path of SPEAK.EXE)/PC Speaker. At this point some strange sounds come out as the driver is initialized. Change the settings to improve the sound quality on the various sounds: tada, chimes, etc. Click OK when you are finished and choose the Restart windows option. Having installed the speaker driver, you will now get sounds whenever you start Windows, make a mistake, or exit Windows. If you do not want this, from the Main/Control Panel/Sounds menu, make sure there is no X next to "Enable System Sounds." Now, you need a sound viewer program that Mosaic can call to display sounds. NCSA unfortunately recommend WHAM, which does not work well with a PC speaker. Get the program WPLANY instead. You can find a copy nearby with an Archie search on the string "wplny"; the current version is WPLNY09B.ZIP. For details on archie and other basic issues related to FTP, please read the Usenet newsgroup news.announce.newusers. Move the zip file to a new directory, and use an unzip program like pkunzip to unzip it, producing the files WPLANY.EXE and WPLANY.DOC. Then edit the MOSAIC.INI file to remove the "REM" before the line "TYPE9=audio/basic". Then, you need lines in the section below that read something like: audio/basic="c:\wplany\wplany.exe %ls" audio/wav="c:\wplany\wplany.exe %ls" where you have filled in the correct path for wplany.exe. The MOSAIC.INI file delivered with Mosaic may have NOTEPAD.EXE on the audio/basic line, but this will not work. Now, restart Mosaic, and you should now be able to produce sounds. To check this, with Mosaic choose File/Local File/\WINDOWS\*.WAV and then try to play TADA.WAV. Then, you might try the Mosaic Demo document for some .AU sounds, but you are lucky if your speaker produces something you can understand. _________________________________________________________________ _World Wide Web FAQ_ AMIGA SERVERS AWS AWS is the first server written specifically for the Amiga. Documentation is available from , and the distribution can be downloaded by anonymous FTP from . NCSA NCSA's Unix server has been ported to the Amiga, and is bundled with the AMosaic browser; however, a web page about the port is no longer available. _________________________________________________________________ _World Wide Web FAQ_ MACINTOSH SERVERS WebSTAR WebSTAR is an "industrial-strength" commercial World Wide Web server from StarNine, Inc. (URL is ). MacHTTP MacHTTP (URL is ) is a freely available web server for the Macintosh. There is also a Frequently Asked Questions posting dedicated to MacHTTP: Mac Common Lisp Server A server written in Mac Common Lisp (URL is ) is now available. The Mac Common Lisp server supports extension of the server with object-oriented Lisp code and is freely available, including source. _________________________________________________________________ _World Wide Web FAQ_ MSDOS AND NOVELL NETWARE SERVERS KA9Q KA9Q NOS (nos11c.exe) is a internet server package for DOS that includes HTTP and Gopher servers. It can be obtained via anonymous FTP from one of the following sites: inorganic5.chem.ufl.edu biochemistry.cwru.edu GLACI-HTTPD GLACI-HTTPD is a Netware Loadable Module which allows a Novell NetWare server to become a World Wide Web server (URL is http://www.glaci.com/info/glaci-httpd.html ). The Major BBS Galacticomm's Major BBS software now has an Internet Connectivity Option that adds web server capabilities (URL is ). _________________________________________________________________ _World Wide Web FAQ_ UNIX SERVERS NCSA httpd NCSA has released a server, known as the NCSA httpd; it is available at the URL ftp://ftp.ncsa.uiuc.edu/Web/httpd . EIT httpd EIT has created the Webmaster's Starter Kit, which installs their WWW server on your system via the web through a painless forms interface. Recommended for those unfamiliar with server installation. You can learn more about the starter kit and the EIT httpd at the starter kit site (URL is http://wsk.eit.com/wsk/doc/ ). Apache httpd Apache is a powerful, reliable drop-in replacement for the NCSA httpd, currently available in beta test form: CERN httpd CERN's server is available for anonymous FTP from ftp.w3.org (URL is http://www.w3.org/hypertext/WWW/Daemon/Status.html ) and many other places. Use your local copy of archie to search for "www" in order to find a nearby site. Netscape's Netsite Servers Netscape Communications Corporation offers two server products, high-end Netscape Commerce Server (capable of secure transactions) and the less expensive Netscape Communications Server. Both products feature a more efficient replacement for CGI (common gateway interface) programming and are designed to be more efficient than traditional free-of-charge servers such as the NCSA and CERN http demons. Compuserve Internet Office Web Server Compuserve's Internet division (formerly Spry) offers the Internet Office Web Server, available for both Unix and Windows NT. The standard edition can be tried out for free. The professional edition includes editing tools and supports S-HTTP security and SQL database connectivity. GN Gopher/HTTP server The GN server is unique in that it can serve both WWW and Gopher clients (in their native modes). This is a good server for those migrating from Gopher to WWW, and includes some of the more powerful web server features as well (such as CGI scripts). See the URL http://hopf.math.nwu.edu/. Perl server There is also a server written in the Perl scripting language, called Plexus, for which documentation is available at the URL http://bsdi.com/server/doc/plexus.html . WN Server The WN Server, available at the URL http://hopf.math.nwu.edu/docs/manual.html , is designed with an emphasis on security and flexibility, and takes a different approach from the NCSA and CERN servers. It provides text searching facilities as a standard feature. Phttpd The Phttpd Server, available by anonymous FTP from ftp.lysator.liu.se in the directory pub/phttpd, is a multithreaded server for Sun's Solaris 2.X operating system which takes advantage of memory mapping and dynamic linking to achieve excellent performance. Open Market Web Servers Open Market offers two commercial products, WebServer and the Secure WebServer. The latter supports the Secure HTTP standard for secure transactions. Both are multithreaded for efficiency and emphasize strong logging features and access control (URL is ). Spinner Spinner is a free web server for Unix platforms which supports extensive server-side parsing of documents, completely avoids forking for non-CGI accesses, and supports multiple roots for multiple host names (URL is ). _________________________________________________________________ _World Wide Web FAQ_ VM/CMS SERVERS A VM/CMS web server is available (URL is ) for more information. _________________________________________________________________ _World Wide Web FAQ_ VMS SERVERS CERN HTTP for VMS A port of the CERN server to VMS. Available at the URL http://delonline.cern.ch/disk$user/duns/doc/vms/distribution.ht ml . Region 6 Threaded HTTP Server A native VMS server which uses DECthreads(tm). This is a potentially major performance advantage because VMS has a high overhead for each process, which is a problem for the frequently-forking NCSA and CERN servers that began life under Unix. A multithreaded server avoids this overhead. Available at the URL http://kcgl1.eng.ohio-state.edu/www/doc/serverinfo.html . _________________________________________________________________ _World Wide Web FAQ_ MS WINDOWS, IBM OS/2 AND MS WINDOWS NT SERVERS HTTPS (Windows NT) HTTPS is a server for Windows NT systems, both Intel and Alpha -- based. It is available via anonymous FTP from emwac.ed.ac.uk in the directory pub/https (URL is ftp://emwac.ed.ac.uk/pub/https). (Be sure to download the version appropriate to your processor.) You can read a detailed announcement at the FTP site, or by using the URL ftp://emwac.ed.ac.uk/pub/https/https.txt. A professional version is also available (URL is http://emwac.ed.ac.uk/html/internet_toolchest/https/prof.htm ). goserve for OS/2 goserve (URL is ) is a one-piece World Wide Web and Gopher for OS/2. Designed for ease of installation. zbserver zbserver is a shareware server for Windows which supports both http and gopher access (URL is ). Purveyor From Process Software Corporation. For Windows NT. Based on the EMWAC source code, with enhancements (URL is ). Windows httpd The Windows httpd (URL is ) has most of the features of the Unix version, including scripts (which generate pages on the fly based on user input). Scripts can be implemented in Visual BASIC; they can also be implemented in Perl or any other language available for MSDOS. CGI DOS programs can be conveniently debugged using the CGI-DOS Perl library (URL is ). SerWeb A simple, effective server for Windows writtten by Gustavo Estrella. Available by anonymous ftp from winftp.cica.indiana.edu (or one of its mirror sites, such as nic.switch.ch), as the file serweb03.zip, in the directory /pub/pc/win3/winsock. There is also a Windows NT version of SerWeb, available by anonymous FTP from emwac.ed.ac.uk as /pub/serweb/serweb_i.zip. Chameleon Web Personal Server Included with the Chameleon TCP/IP software from Netmanage, Inc. Comments, anyone? WEB4HAM Another Windows-based server, available by anonymous FTP from ftp.informatik.uni-hamburg.de as /pub/net/winsock/web4ham.zip. OS2HTTPD An OS/2 server, written by Frankie Fan. See the home page (URL is ftp://ftp.netcom.com/pub/kf/kfan/overview.html ) for details, or fetch the package by anonymous FTP from ftp.netcom.com in the directory pub/kf/kfan. Netscape's Netsite Servers Netscape Communications Corporation offers two server products, high-end Netscape Commerce Server (capable of secure transactions) and the less expensive Netscape Communications Server. Both products feature a more efficient replacement for CGI (common gateway interface) programming and are designed to be more efficient than traditional free-of-charge servers such as the NCSA and CERN http demons. Alibaba Alibaba is Computer Software Manufaktur's NT-based web server, which takes advantage of multithreading for best performance: WebSite WebSite (URL is ) is a Windows NT-based web server available from O'Reilly. WebSite offers a graphical, user-friendly front end to the server for easy file manipulation, and includes software to track down broken links. WebSite also runs under Windows 95. Compuserve Internet Office Web Server Compuserve's Internet division (formerly Spry) offers the Internet Office Web Server, available for both Unix and Windows NT. The standard edition can be tried out for free. The professional edition includes editing tools and supports S-HTTP security and SQL database connectivity. FolkWeb WWW Server FolkWeb is a Windows NT and 95 web server which takes advantage of threads and offers friendly GUI-based configuration. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN TWO DIFFERENT HOME PAGES SHARE ONE PHYSICAL MACHINE? Dan Pritchett maintains a document detailing the process of running two or more servers on the same machine without end users being able to tell the difference (URL is ). _________________________________________________________________ _World Wide Web FAQ_ YEAH, BUT WHICH SERVER IS BEST? To find out which server is best for your needs, you will want to consult Paul Hoffman's Server Comparison Chart (URL is ). _________________________________________________________________ _World Wide Web FAQ_ HOW FAST DOES MY NET CONNECTION NEED TO BE? The following response to this very-frequently-asked-question was provided by Mike Meyer (mwm@contessa.phone.net). The answer is "It depends." What it depends on is what kind of things you want to provide on your server. Here are some rules of thumb to use when deciding what kind of connection you need for your server. The first rule of thumb is: _Don't worry about simultaneous access._ Unless you have a very large site, simultaneous access is not a problem. If you have a very large site, you need as much bandwidth as you can afford. There is a bit more about this below. The second rule of thumb is: _It should take at most 5 seconds to send a page._ The five second rule dates from command line days, when that was about how long people would wait before getting impatient with the system. It seems like a reasonable number to use now. Since external images/audio/etc. are somewhat exceptional, allow more time for them. If you think they should have the same restrictions as above, buy the bandwidth your site will need to do so. However, the rule of thumb for external images/audio/etc is: _It should take at most 30 seconds to send an external file._ Given these rules, it's pretty straightforward to work out how large an HTML page and external files can be. At least, it's easy after you simplify things by ignoring IP overhead on the line, compression on modem lines, and anything that's less than 10% of the total (or even a little bit more than 10%). The one simplification not to ignore is the multiple packet round-trips it takes to get data flowing through an HTTP channel. For modem lines, this is nearly a second for each HTTP connection, which is significant. For leased lines, it's more like .1 or .2 seconds, which is not significant. On a 14.4 line assumed to be sending 1.4K bytes of data/second, with a 1 second startup, you get 4 * 1.4 or 5.6K of HTML. If you want to include a single inline image, that's 2 seconds of startup, so you're down to 3 * 1.4 or 4.2K of HTML + image. This means smallish HTML pages, and simple inline images. For external files, you get 29 * 1.4 or 40K, which is still a small image. If you have a 28.8 line, you get to double those figures; for a 9600 line, figure 2/3rds of that size. On a 56K leased line assumed to be sending 5K/second, you get 25K of HTML, or mixed HTML/data. For external images, it's 150K. That should cover any reasonable HTML document, and small to medium external files. An MPEG movie might be a bit much. With a T1 line assumed to be sending 150K/second, you get 750K of HTML, or 4.5 megabytes in an external file. Barring very large animations, this should be sufficient for anything you want to serve. More would be faster, but it also gets drastically more expensive. Given the above guidelines, let's look at simultaneous access again. Under the worst case conditions, you're using all of your line for HTML pages, each of which takes 5 seconds to send, so your server is sending 12 pages a minute, or 720 pages an hour, or 17,000 pages a day (pages, not accesses; each inline image in a page generates an access, unless the client cached it). This makes you one of the busier sites on the web. While you'll have contention problems before you get to this point, anything but a modem connection will be sending most pages in a small fraction of five seconds, which should leave plenty of bandwidth with no contention. If you have this kind of access rates on a modem line, you should seriously consider upgrading your connection. The bottom line on simultaneous access is that the WWW server is more likely to have contention with other uses of the line than with itself. Since I don't know what else you use your line for, I can't factor it in. You'll have to consider that issue yourself. _________________________________________________________________ _World Wide Web FAQ_ DO I HAVE TO APPROVE EVERY IMAGEMAP MY USERS CREATE? Not if you update to the latest and greatest imagemap software. The problem is that the NCSA web server imagemap program used to require a central configuration file. This restriction has been lifted in version 1.4 of the NCSA web server (read more at ). The CERN imagemap program never did have this restriction (consider ). Also consider Jutta Degener's "umap" ( ), a flexible alternative to the standard imagemap utilities. _________________________________________________________________ _World Wide Web FAQ_ CAN I SAFELY ALLOW MY USERS TO RUN THEIR OWN CGI SCRIPTS? CGI scripts are a very powerful facility, with some risks attached to them. In a Unix system, if CGI scripts run with the same user ID as the web server itself, poorly or maliciously written scripts can damage files or open security holes. There are two important steps that should be taken to correct this: 1. _NEVER_ run your web server as root; make sure it is configured to change to another user ID at startup time. (This is standard practice in all web server distributions, but administrators have been known to change it back to running as root anyway. Don't.) 2. Consider using a wrapper such as , user.c , or CGIwrap to ensure that each CGI script runs with the permissions and user ID of the user responsible for it. If proper precautions are taken, user CGI scripts can be reasonably safe. As always, dumb mistakes that open security holes for outsiders are more likely to be the cause of problems than actual malice on the part of your own users. _________________________________________________________________ _World Wide Web FAQ_ CAN I BUY SPACE ON AN EXISTING SERVER? Yes, you can. A list of sites offering WWW space for lease is available (at the URL http://union.ncsa.uiuc.edu/HyperNews/get/www/leasing.html ). _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I KEEP ROBOTS OFF MY SERVER? Programs that automatically traverse the web can be quite useful, but have the potential to make a serious mess of things. Every so often someone will write a "depth-first" searching robot that brings servers to their knees. See the section on writing robots for details. Fortunately, most robots on the web follow a simple protocol by which you can keep them off your server if you wish, or keep them out of portions of your server which are robot traps (ie, they contain an infinite number of possible links). Read the document World Wide Web Robots, Wanderers and Spiders (URL is ) and learn about the emerging standards for exclusion of robots from areas in which they are not wanted. You can also read about existing robots there, including useful cataloging robots you probably do _not_ want to keep off your server. _________________________________________________________________ _World Wide Web FAQ_ HOW DO I PUBLICIZE MY WORK? There are several things you can do to publicize your new HTML server or other offering: * Post to comp.infosystems.www.announce. PLEASE READ THE CHARTER POSTING FIRST. In general, always read a newsgroup first to familiarize yourself before posting to it. * Submit it to Yahoo (URL is ), an impressive index of the web which expands its knowledge automatically but permits the direct submission of URLs as well. * Submit it to the NCSA What's New Page at the URL http://www.ncsa.uiuc.edu/SDG/Software/Mosaic/Docs/whats-new.html (see the page for details on how to submit your listing!). * Register your URL in the Lycos Database (URL is ). * Submit your URL to the maintainers of various catalogs, such as the WWW Virtual Library (at the URL http://www.w3.org/hypertext/DataSources/bySubject/Overview.html ) and the ALIWEB index (at the URL http://web.nexor.co.uk/aliweb/doc/aliweb.html ). * Read Gareth Rees' guide to publishing on the World Wide Web. (URL is http://www.cl.cam.ac.uk/users/gdr11/publish.html ). * Consult Pete Page's How to Announce your New Web Site (URL is ). _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I RESTRICT AND CONTROL ACCESS TO MY SERVER? All major servers have features that allow you to limit access to particular sites, and many clients have authentication features that allow you to identify specific users. An overview of this topic available from the w3 Organization web server (URL is ). There is also a tutorial on security and user authentication with the NCSA server and Mosaic available, written by Marc Andreessen (URL is ). See your server documentation for further information. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I KEEP STATISTICS ABOUT MY WEB SERVER? There are several tools which can generate statistics about your web server. Combined Log Handling System The Combined Log Handling System is a log analyzer written in Perl which is able to read the logs of many different server packages, including ftp, gopher, several web server flavors, archie, and others. The system converts log entries to a single format and providing summary data (URL is ). getstats getstats is a versatile log analyzer, written in C, which provides reports for various time periods with a high degree of flexibility. Add-on packages have been written to generate reports in HTML and also to generate graphs. You can access the getstats home page for more information (URL is http://www.eit.com/software/getstats/getstats.html ), or obtain the package by anonymous FTP from ftp.eit.com in the directory /pub/web.software/getstats. WebStat WebStat is a package written in the language Python which supplies statistics on usage by domain, country, etc., with daily, weekly, monthly and annual reports available. You will need Python in order to use it. See the WebStat home page (URL is http://www.pegasus.esprit.ec.org/people/sijben/statistics/adve rtisment.html ) for details, or obtain Python from ftp.cwi.nl in the directory /pub/python and WebStat from ftp.pegasus.esprit.ec.org in the directory /pub/misc. Wusage Wusage, which I wrote, is a C program which generates simple weekly reports in HTML, with inline image graphs displaying server growth and the distribution of accesses by continent. You can also exclude irrelevant accesses (inline images, local machines, etc.) from the results. Read the Wusage home page (URL is http://siva.cshl.org/wusage.html ) for more information, or obtain Wusage by anonymous FTP from isis.cshl.org in the directory pub/wusage. wwwstat wwwstat is a full-featured log analyzer written in the language Perl. (See the newsgroup comp.lang.perl.misc for more information about the language.) See the wwwstat home page (URL is http://www.ics.uci.edu/WebSoft/wwwstat/) for more information, or obtain the package by anonymous FTP from liege.ics.uci.edu in the directory /pub/arcadia/wwwstat. See also gwstat (URL is http://dis.cs.umass.edu/stats/gwstat.html ), a package which produces GIF graphs from the output of wwwstat. bert Bert is an acronym for Browser-log Extraction and Reporting Tool. It takes the agent_log and gives information about which browsers people have been using to access your site with. You can access the bert home page for more information (URL is ). Quickstats Quickstats is a straightforward log analysis package, oriented toward simple queries such as the popularity of a particular page. Quickstats can also ignore specific sites, among other options. Check out the QuickStats home page: ErrorChk Unlike most log statistics programs, ErrorChk analyzes and reports on the contents of the error log created by the NCSA server. This is useful as a means of diagnosing server problems. (URL is ) Snowhare's Log Analysis Tools Snowhare (Benjamin Franz) has made a suite of log analysis tools written in Perl available at which include graphical reports. analog Analog is a server log analysis package which emphasizes simplicity of installation, speed and attractive results. See for more information. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I SERVE [WORD DOCUMENTS, EXCEL SPREADSHEETS, DOUGHNUTS]? In order to deliver documents of new and different types from your server, you need to configure the correct "MIME type" for each type of document, and use the proper extension when naming the file on the server. If the document type is highly unusual, you will also need to see to it that users know what MIME type to configure their browsers for, and what application to launch for that MIME type. More information on this subject is available in Ken Jenks' file format recommendations for web servers, . _________________________________________________________________ _World Wide Web FAQ_ PRODUCING HTML DOCUMENTS HTML is the simple markup system used to create hypertext documents. HTML is not intended to be a comprehensive page-layout system. Instead, HTML aims to let you describe the _structure_ of your document by indicating headings, emphasis, links to other documents and so forth. The more you work with HTML rather than against it, the happier you'll be. You can include images and other multimedia objects in your documents, but it should be remembered that not all web users have graphical clients, and many web users voluntarily turn graphics _off_ to save downloading time! If you try to spite such users, you will only lose readers (and customers). You can in fact specify a great deal about the appearance of your document in the latest web browsers. There is no harm in taking advantage of these features, but as a rule of thumb, always make sure your document looks good in a text-based browser such as Lynx as well as in the graphical browser of your dreams. This is more than a simple matter of taste. Keep in mind that not all users can see! There are three ways to produce HTML documents: writing them yourself, which is not a very difficult skill to acquire, using an HTML editor, which assists in doing the above, and converting documents in other formats to HTML. The following three sections cover these possibilities in sequence: * Writing HTML yourself * HTML editing tools * Conversion tools _________________________________________________________________ _World Wide Web FAQ_ WRITING HTML DOCUMENTS YOURSELF You can write an HTML document with any text editor. Try the "source" button of your browser (or "save as" HTML) to look at the HTML for a page you find particularly interesting. The odds are that it will be a great deal simpler than you would expect. If you're used to marking up text in any way (even red-pencilling it), HTML should be rather intuitive. A beginner's guide to HTML is available at the URL http://www.ncsa.uiuc.edu/General/Internet/WWW/HTMLPrimer.html . You can also find a compressed Postscript version (at the URL ftp://ftp.ncsa.uiuc.edu/ncsapubs/WWW/HTMLPrimer.ps.Z). (Since the latter two are FTP URLs, you can fetch them by hand using FTP if you do not yet have a web browser.) There is also an HTML primer by Nathan Torkington at the URL http://www.vuw.ac.nz/who/Nathan.Torkington/ideas/www-html.html . _________________________________________________________________ _World Wide Web FAQ_ HTML EDITORS Some editors are WYSIWYG (What You See Is What You Get), or close to it; others simply assist you in writing HTML by plugging in the desired markup tags for you from a menu. The latter are surprisingly useful, and the former surprisingly limited. As a rule of thumb, if you are keenly interested in using the very latest new HTML feature, you will probably be disappointed with WYSIWYG editors. Some WYSIWYG editors do support entry of unfamiliar tags, however. A few can even display them in the color or style of your choice. This document covers editors for the following systems: * HTML editors for the Mac * HTML editors for Microsoft Windows * HTML editors for Unix (non-graphical) * HTML editors for the X Window System * Miscellaneous editors HTML Editors for the Mac HTML Editor A near-WYSIWYG package URL is ). A stand-alone program. ANT_HTML ANT_HTML is a Word for the Macintosh template designed to convert Word documents into HTML documents in a WYSIWYG environment. It includes a demo version of the ANT_PLUS utility, which converts HTML files to WYSIWYG. ANT_PLUS also converts HTML files to ASCII, RTF, or any other format possible in Word. At the time of this writing it was scheduled to have been released on the Macintosh (it has long been available for Windows). Contact jswift@freenet.fsu.edu for more information. BBEdit HTML extensions This package of extensions allows the BBEdit and BBEdit Lite text editors for the Macintosh to conveniently edit HTML documents. (URL is .) You can also obtain the extensions package by anonymous ftp from sumex-aim.stanford.edu as info-mac/bbedit-html-ext-b3.hqx. Also see below. BBEditTools There is an alternative BBEdit extension package available as well (URL is ) . it is available by FTP from ftp.york.ac.uk in the directory /pub/users/ld11/BBEdit_HTML_Tools.sea.hqx. SoftQuad HoTMetaL SoftQuad's HoTMetaL is a WYSIWYG HTML editor designed from the ground up to edit HTML. Unlike HTML modes for existing word processors, every aspect of HoTMetaL reflects this purpose. html-helper-mode for EMACS Users of the EMACS editor will want to consider html-helper-mode, an EMACS "mode" for HTML editing (see ). HTML Editors for Microsoft Windows Internet Assistant Microsoft has released Internet Assistant, a Word for Windows template which can edit HTML in a WYSIWYG manner, including the capability to load existing HTML documents. It also includes rudimentary browsing capabilities, sufficient to assist in editing (URL is ). ANT_HTML ANT_HTML is a Word template for both Windows (URL is ) and the Macintosh (URL is ) designed to convert Word documents into HTML documents in a WYSIWYG environment. It includes a demo version of the ANT_PLUS utility, which converts HTML files for importation and further editing. ANT_PLUS also converts HTML files to ASCII, RTF, or any other format possible in Word 6.0. Contact jswift@freenet.fsu.edu for more information. Quarterdeck WebAuthor Yet another commercial Word for Windows HTML editing template is available from Quarterdeck (URL is ) and is rumored to be superior to Internet Assistant. HTML Assistant A non-WYSIWYG editor called HTML Assistant is available, with features to assist in the rapid creation of HTML documents. A good choice for experienced HTML authors wishing to save keyboarding time. Available by anonymous FTP from ftp.cs.dal.ca in the directory /htmlasst/. Read the README.1ST file in this directory for information on which files to download. See also: Live Markup ( ) is a WYSIWYG HTML editor for Windows which insulates the user completely from HTML. Excel 5.0 to HTML Table Creator Most HTML editing facilities leave out table-editing capabilities. Fill that gap with Jordan Evans' Excel 5.0 to HTML Table Converter (URL is ). WEB Wizard For beginners in search of a quick and easy way to build a home page, consider WEB Wizard (URL is ), a simple package which prepares a home page after a question-and-answer session with the user. 16-bit and 32-bit Windows versions are available. HTML Writer A simple, useful non-WYSIWYG HTML editor that cooperates closely with most web browsers is HTML Writer, . "Donationware." SoftQuad HoTMetaL SoftQuad's HoTMetaL is a WYSIWYG HTML editor designed from the ground up to edit HTML. Unlike HTML modes for existing word processors, every aspect of HoTMetaL reflects this purpose. WebEdit WebEdit is a non-WYSIWYG editor (it does include a WYSIWYG editor for HTML 3.0 tables). Spell-checking is standard, and support is claimed for all HTML 3.0 features. See: Emissary Wollongong's Emissary is a complete Internet software suite which includes WYSIWYG HTML editing features (see ). html-helper-mode for EMACS Users of the EMACS editor will want to consider html-helper-mode, an EMACS "mode" for HTML editing (see ). HTML Editors for Unix (non-graphical) html-helper-mode for EMACS Users of the EMACS editor will want to consider html-helper-mode, an EMACS "mode" for HTML editing (see ). HTML Editors for the X Window System TkWWW (URL is ) supports WYSIWYG HTML editing; and since it's also a browser, you can try out links immediately after creating them. Phoenix (URL is http://www.bsd.uchicago.edu/ftp/pub/phoenix/README.html ) A fully WYSIWYG HTML editor which insulates the user from direct control of the HTML tags. Available by anonymous FTP from www.bsd.uchicago.edu in the pub/phoenix subdirectory. ASHE A WYSIWYG HTML editor which takes advantage of the NCSA Mosaic HTML "widget" (URL is ). htmltext htmltext supports WYSIWYG HTML editing. More information is available at the URL . html-helper-mode for EMACS Users of the EMACS editor will want to consider html-helper-mode, an EMACS "mode" for HTML editing (see ). WebAuthor A fully WYSIWYG commercial HTML editing product from Silicon Graphics (URL is ). SoftQuad HoTMetaL SoftQuad's HoTMetaL is a WYSIWYG HTML editor designed from the ground up to edit HTML. Unlike HTML modes for existing word processors, every aspect of HoTMetaL reflects this purpose. Miscellaneous editors html-helper-mode for EMACS Users of the EMACS editor will want to consider html-helper-mode, an EMACS "mode" for HTML editing (see ). HTML DTD Another option, if you have an SGML editor, is to use it with the HTML DTD (URL is ). NCSA's List of Filters and Editors See for an another list of available HTML editing products. _________________________________________________________________ _World Wide Web FAQ_ CONVERTING OTHER FORMATS TO HTML There is a collection of filters for converting your existing documents (in TeX and other non-HTML formats) into HTML automatically, including filters that can allow more or less WYSIWYG editing using various word processors: Rich Brandwein and Mike Sendall's List (URL is http://www.w3.org/hypertext/WWW/Tools/Filters.html ). (Note that this URL contains uppercase and lowercase letters; certain operating systems such as VMS require you to quote mixed-case URLs when launching a borwser from the command line. This is NOT a bug in the browser.) _________________________________________________________________ _World Wide Web FAQ_ CHECKING YOUR HTML FOR ERRORS Tools to validate your HTML documents (check them for errors) are available. There is a form at the URL http://www.hal.com/~markg/WebTechs/validation-form.html which will check HTML documents for errors according to the latest specification; note that you are encouraged to set up the program on your own system if you make heavy use of the form. There is also a tool which will check the links in your documents for links to nonexistent resources, such as pages that have moved (URL is http://wsk.eit.com/wsk/dist/doc/admin/webtest/verify_links.html ). Also try weblint (URL is http://www.khoros.unm.edu/staff/neilb/weblint.html ), a Perl script that checks your HTML for errors; you can even try it out over the web through an HTML form. The script is available by anonymous FTP from ftp.khoros.unm.edu in the directory pub/perl/www. Another such tool is htmlchek (URL is: http://uts.cc.utexas.edu/~churchh/htmlchek.html ), which checks HTML documents for errors, creates a cross-reference, automatically expands entities (such as European characters) to their proper HTML form, and performs other useful services. htmlchek is available by anonymous FTP from ftp.cs.buffalo.edu in the directory pub/htmlchek. lvrfy is a simple, Unix-based link-checking program which checks your pages for broken links (URL is ). Checker, at , is another useful broken-link finder; binaries are available for numerous systems. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I "INCLUDE" ONE HTML DOCUMENT IN ANOTHER? Often HTML authors have a copyright notice, logo or other piece of HTML which needs to be included on many different pages. Doing this by hand is, obviously, painful. One might think there would be an tag, much like , to include one document in another. But this has several problems, one of which is that it would require opening a second connection to the server. This is very inefficient (translation: SLOW for your readers). "So what can I do about it?" The most common solution is the "server-side include" mechanism. The NCSA web server can be configured to recognize documents ending in ".shtml" instead of ".html" as documents that it should scan for server-side include commands referencing other documents or scripts. For details, see the NCSA server documentation . _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I CREATE A CUSTOM BACKGROUND AND SET THE TEXT COLORS? The capability to do this was introduced by Netscape in version 1.1 of that product. By now, many web browsers support it. _Please note:_ if your page is difficult to read, people will not read it. Please use the background attributes tastefully unless it is your intention to alienate your readership. A separate FAQ on the subject is maintained by Mark Koenen. Consult that document for more information. _________________________________________________________________ _World Wide Web FAQ_ HOW DO I GENERATE WEB PAGES FROM A PROGRAM OR DATABASE? Most web servers support one variation or another of a standard for adding your own programs to the web server. The standard is called CGI (Common Gateway Interface). Marc Hedlund has written a FAQ on CGI programming (URL is ) which makes a good introduction to the subject. The standard itself can be found at NCSA (URL is ). For tips on overcoming common CGI problems, consult the CGI problems section and the section on granting CGI access to users. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I IDENTIFY THE USER WHO IS ACCESSING MY CGI SCRIPT? Five important environment variables are available to your CGI script to help in identifying the end user. HTTP_FROM This environment variable is, theoretically, set to the email address of the user. However, many browsers do not set it at all, and most browsers that do support it allow the user to set any value for this variable. As such, it is recommended that it be used only as a default for the reply email address in an email form. REMOTE_USER This variable only set if secure authentication was used to access the script. The AUTH_TYPE variable can be checked to determine what form of secure authentication was used. REMOTE_USER will then contain the name the user authenticated under. REMOTE_IDENT This variable is set if the server has contacted an IDENTD server on the client machine. This is a slow operation, usually turned off in most servers, and there is no way to ensure that the client machine will respond honestly to the query, if it responds at all. REMOTE_HOST This variable will not identify the user specifically, but does provide information about the site the user has connected from, if the hostname was retrieved by the server. In the absence of any certainty regarding the user's precise identity, making decisions based on a list of trusted addresses is sometimes an adequate workaround. This variable is not set if the server failed to look up the host name or skipped the lookup in the interest of speed; see REMOTE_ADDR below. REMOTE_ADDR This variable will not identify the user specifically, but does provide information about the site the user has connected from. REMOTE_ADDR will contain the dotted-decimal IP address of the client. In the absence of any certainty regarding the user's precise identity, making decisions based on a list of trusted addresses is sometimes an adequate workaround. This variable is always set, unlike REMOTE_HOST, above. _________________________________________________________________ _World Wide Web FAQ_ MY CGI SCRIPTS DON'T WORK. HOW CAN I DEBUG THEM? Several common causes are described here. Note that every web server is different; your mileage will almost certainly vary. In particular, Windows and Macintosh servers differ drastically from Unix servers. See your server's documentation. The Server Must Recognize Your Program Simply linking from your page to an executable program or script won't cause it to be run by the server. There are two common arrangements: either files in directories specially designated by the server administrator are executed as CGI scripts, or files with a special extension (such as .cgi) are executed as CGI scripts. These are just two possible ways your server might be configured. Many sites don't allow users to run CGI scripts at all. _Consult your web server's administrator._ Always Output a MIME Type Every CGI script must output a _MIME type_ indicating what kind of document it is producing. If your script outputs an HTML page, the correct format is: Content-type: text/html Followed by _two_ line feeds (ascii 10 decimal). _After_ the MIME type, output the desired HTML. Always Flush Output On many systems, unexpected problems can result when a CGI script outputs a MIME type, then executes another program to generate output. To prevent such problems, flush standard output before executing other programs. If your script is written in C, the proper code is usually: fflush(stdout); Permissions and Paths: Why Can't My Script Access My Files? CGI programs typically execute with a current directory and user ID that differ from your personal home directory and user ID. When you write CGI programs, make sure any files accessed are accessed by absolute path (beginning from the root of the file system). Also, users of multiuser systems such as Unix may have to grant all users read access or even write access to data files using the chmod command. This is not an ideal situation. Better servers run your CGI programs using your user ID. Talk to your admin if you have difficulties in this area. When One Browser Works and Another Doesn't Some browsers are tolerant of incorrect Content-type headers, as well as of null characters in text/html or text/plain output. Make sure your output is strictly correct; it helps to check the script with Netscape, Mosaic and Lynx. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I MAKE SURE MY CGI-GENERATED PAGE IS NOT CACHED BY THE CLIENT? If your CGI-generated page is intended to produce completely different content on each access, it is important to convince the web client _not_ to display a cached copy the next time the user accesses it. One workaround is to make sure that all links the CGI program generates to itself contain a unique, random piece of information which is then ignored by the program when it arrives as part of the PATH_INFO environment variable. But this is not ideal, since the user will still see the same output again upon returning to a bookmark. However, consider the following alternatives: Some browsers support the Pragma: no-cache header. In this case, the following output at the beginning of your CGI program will specify both the content type and the fact that the page should never be cached: Content-type: image/html Pragma: no-cache Note the two carriage returns at the end, always required before the beginning of the actual document. Alternatively, if the page is "good" for some fixed amount of time, the "Expired:" HTTP header can be used to specify the time after which the page must be fetched again. _Important:_ The Greenwich Mean Time (GMT) must be specified, not the local time. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN USERS SEND ME COMMENTS AND/OR EMAIL? There are two ways: Using a mailto: URL You can simply create a link which looks like this: Send Me Mail This works great for browsers that support the mailto: URL. Perhaps 80% of web users will be able to use such a link. But not all browsers support it. Installing a comment form If you have access to the server's configuration files, or if your server administrator permits users to create their own CGI scripts, you can create a form which sends mail to you from any browser that supports forms. A really flexible package for this is the mit-dcns-cgi package (URL is ). I've written a simple email forms package (URL is ), which does it in ANSI C. There is also a package written in Perl, known as the WWW Mailto Gateway (URL is ). GetComments (URL is ) is a more general package, also written in Perl, which can do many different things in response to a form submission. Tcl programmers may wish to try J.M. Ivler's TCL mail forms package . If you want to learn how these forms actually work, see the entry on CGI scripts. _________________________________________________________________ _World Wide Web FAQ_ WHERE CAN I LEARN HOW TO CREATE FILL-OUT FORMS? Writing an HTML form is easy, but the form doesn't accomplish anything until you write a CGI program to interpret the results on the server side! For more information, see the section on CGI scripts. See the section on email forms for a simple solution to the most commonly desired form. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I CREATE DECENT-LOOKING TABLES AND STOP USING
... 
? Tables are a standard feature in HTML Level 3, a new version of HTML. Unfortunately, not all browsers implement them, although they are supported by the latest versions of Netscape, NCSA Mosaic, and Viola. There is a way to use HTML Level 3 tables while writing your pages and convert them automatically to HTML 2.0, allowing you to design proper tables and install those pages directly when table support arrives in whatever clients your users prefer. You can do this using the html+tables package, by Brooks Cutter (bcutter@paradyne.com), which is available for anonymous ftp from sunsite.unc.edu in the directory pub/packages/infosystems/WWW/tools/html+tables.shar. This package requires the shell language Perl, which is primarily used on Unix systems but is also available for other systems (such as MSDOS machines). html+tables accepts HTML Level 3 and outputs html using the
...
construct to represent tables, allowing you to write HTML Level 3 now, knowing that it will look better when clients are ready for it. (This is less of an issue now that table support is becoming widespread in better browsers.) _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I USE INLINE IMAGES WITHOUT ALIENATING MY USERS? If you pay any attention to comments from users of your web pages, you will quickly learn that 500K GIFs are only pretty to the four or five users who have a personal T1 line. I'm exaggerating, but not all that much. It's astonishing how many web site producers have never tested their site through one of the 14.4kbps modems (that's only 1600 bytes per second on a good day, remember) that the _actual customer_ is using. But inline images can be useful, provocative and amusing. What can be done to make them available to those who can wait for them and unobtrusive to those who can't? 1. _ALWAYS_ Provide alternatives to imagemaps Even users who run Netscape often turn off image loading or don't want to wait long enough for an interlaced GIF to become recognizable on their screen in order to navigate your site. Always provide a set of text-based links to the same destinations. 2. Keep image file sizes modest For ways to make your images download faster _without_ throwing away image quality, see the guidelines maintained by the Bandwidth Conservation Society (URL is ). 3. Provide a text-only page If you follow the guidelines above, you may not need to provide a text-only version of your page, but if you insist on having an image-heavy page, provide a plaintext page as well. Please consider the needs of blind users as well as those with limited bandwidth, and keep in mind that nearly _all_ your users are in the latter category and will be for several years yet! _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I DISTRIBUTE AUDIO THROUGH THE WEB? Not all web browsers have audio support built-in, but nearly all can launch external "viewers" to handle audio. These player programs are widely available as freeware or shareware for most architectures (or standard with your operating system). Audio is a particularly thorny case owing to the need to download the entire audio program before it can be heard. Alternatives to this delay are beginning to appear. I am openly soliciting URLs for other WWW-related audio products. RealAudio By Progressive Networks (URL is ). The RealAudio player can communicate with a specialized RealAudio server in order to play back audio as it is downloaded, eliminating download delays even over long distances and/or 14.4kbps modems. By Progressive Networks. _Disclaimer:_ I used to work for PN. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I GENERATE GIFS ON THE FLY FROM MY CGI SCRIPTS? If you want to generate GIF images on the fly as part of your application, examine the gd library (URL is: http://siva.cshl.org/gd/gd.html ). _Hint:_ your HTML page and your inline images are separate documents with separate URLs. Generate them in response to separate requests! (Yes, there are tricks to speed this up, but be careful not to break inline images on HTML pages you didn't write that refer to your gd-generated image.) Adaptations of gd are available for Tcl, Perl, and other languages. See the gd page, listed above, for more information. Perl users may also be interested in World Wide Web FAQ HOW CAN I CREATE HIDDEN FIELDS IN FORMS (KEEPING STATE)? Use INPUT TYPE=hidden. An example: By now, most browsers can handle the hidden type, but understand that some browsers will fail to hide the field (and probably confuse the user). Note that "hidden" doesn't mean "secret"; the user can always click on "view source". _________________________________________________________________ _World Wide Web FAQ_ WHAT IS HTML LEVEL 3 AND WHERE CAN I LEARN MORE ABOUT IT? HTML Level 3, formerly known as HTML+, is an enhanced version of HTML designed to address some of the limitations of HTML. HTML Level 3 supports true tables, right-justified text, centered text, line breaks that do not double space, and many other desired features. However, most clients support only a handful of HTML Level 3 features at the time of this writing. The most commonly implemented major feature is table support. If you have access to a Unix system with the X Window System installed, you can try out many features of HTML Level 3 using the experimental Arena browser. You can access information about new developments in HTML at the CERN server (at the URL http://www.w3.org/hypertext/WWW/MarkUp/MarkUp.html ). (HTML Level 1 is the original version. HTML Level 2 is essentially the same, but with the addition of forms.) _________________________________________________________________ _World Wide Web FAQ_ HOW DO I COMMENT AN HTML DOCUMENT? Place . Note that comments do not nest, and the sequence "--" may not appear inside a comment except as part of the closing --> tag. You should _not_ try to use this to "comment out" HTML that would otherwise be shown to the user, since some browsers (notably Mosaic) will still pay attention to tags inside the comment and close it prematurely. _Thanks to Joe English for clearing up this issue._ _________________________________________________________________ _World Wide Web FAQ_ HOW DO I SET UP A CLICKABLE IMAGE MAP? There are really two issues here: how to indicate in HTML that you want an image to be clickable, and how to configure your server to do something with the clicks returned by Mosaic, Chimera, and other clients capable of delivering them. You can read about image maps and the NCSA server at ). Also see Joseph Walker's collection of imagemap resources (URL is ). Using imagemaps requires that you create a map file; you can do this by hand or with a WYSIWYG tool. _VERY IMPORTANT:_ Creating imagemaps requires a real web server (not an FTP server) and a cooperative web server administrator. _It is not usually as simple as wrapping a link around an IMG SRC tag and adding the ISMAP directive;_ the server must also be told about the map file, and the way to accomplish this varies from server to server. So _read your server documentation,_ and don't waste time making maps before making sure you have the necessary tools to deliver them. _Addendum:_ there are now web servers that actually do make it that simple; yours may be one of them. But if you have difficulties, TALK TO YOUR ADMIN AND READ YOUR SERVER MANUAL FIRST (really!) before posting. Map THIS Map THIS (URL is: http://galadriel.ecaetc.ohio-state.edu/tc/mt) is a feature-laden WYSIWYG imagemap editing tool for Microsoft Windows 32-bit environments (Win32s, Windows 95 or Windows NT required; Win32s is available from Microsoft's FTP site, ftp.microsoft.com, among other places). Free. Web Hotspots Web Hotspots (URL is ) is a feature-rich imagemap editor for all Windows sytems, supporting zoom, advanced shape manipulation, and multiple-document interface. Shareware. HoTTmapP Another WYSIWYG imagemap editor for Windows. Features permanent associations between images and map files for convenient reopening and manipulation of existing shapes. The capability to merge data from multiple MAPs is also provided. See for more information. Mapedit Mapedit (URL is ) is a simple WYSIWYG imagemap editing tool for both Microsoft Windows and the X Window System. Shareware. MapMaker For users of John Bradley's _xv_ image display software for the X Window System, Mapmaker can turn the miniature images created by xv's Visual Schnauzer into an imagemap. This is useful if you would like to make an entire directory of images available (but note that you should also make textual links to allow those with text- based browsers to download the images for external viewing). (URL is: http://icg.stwing.upenn.edu:80/~mengwong/mapmaker.html ) WebMap On the Macintosh, you may want to use MacMapMaker, available from . It produces both NCSA and CERN-compatible maps, which can also be used with MacImagemap and a Macintosh-based server (MacImagemap is found in the same directory). There is another package available, called WebMap; however, the only FTP address I have for it points to an expired copy. Tkmapedit For Unix systems and other systems on which the Tk/Tcl language toolkit has been installed, Tkmapedit provides a WYSIWYG imagemap editor which is capable of directly testing links if the tkWWW web browser is available. Available by anonymous FTP from the TCL archive on ftp.aud.alcatel.com. glorglox For Unix systems, glorglox is a unique imagemapping tool which allows color indexes in GIF images to be associated with URLs. It's easier to use this than to describe it (or pronounce it), so check out the glorglox home page (URL is ). _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I MAKE TRANSPARENT AND INTERLACED GIFS? AND WHAT ARE THEY? Transparent GIFs are useful because they appear to blend in smoothly with the user's display, even if the user has set a background color that differs from that the developer expected. They do this by assigning one color to be transparent -- if the web browser supports transparency, that color will be replaced by the browser's background color, whatever it may be. Interlaced GIFs appear first with poor resolution and then improve in resolution until the entire image has arrived, as opposed to arriving linearly from the top row to the bottom row. This is great to get a quick idea of what the entire image will look like while waiting for the rest. This doesn't do much for you if your web browser doesn't support progressive display as the image is downloaded, but non-progressive-display web browsers will still display interlaced GIFs once they have arrived in their entirety. You can make transparent and interlaced GIFs through the web without running any utility software on your own system through the Visioneering image manipulation page (URL is ), which will access your image through the web and produce an enhanced version for you to save. To create transparent and interlaced GIFs under Unix, check out David Koblas' giftool, a program which can manipulate those options and many more aspects of your GIF file. For Windows PCs, try Lview Pro, version 1A or later, available by anonymous FTP from oak.oakland.edu in the directory SimTel/win3/graphics: As well as from many mirror sites. Adobe Photoshop users will be interested in PhotoGIF , Boxtop Software's plug-in to add sophisticated GIF support to Photoshop. PhotoGIF can save transparent and interlaced GIFs, as well as optimizing GIF images in other ways. You can also create transparent and interlaced GIFs using the widely available NETPBM tools (an enhanced version of the older pbmplus tools, which do _not_ support these options). The following Unix shell script, contributed by Shane Castle, can make any GIF image transparent if a recent version of the netpbm utilities has been installed: #!/bin/sh if [ $# -lt 2 ] then echo "Usage: transparize gifname color" echo " gifname - name of GIF file" echo " color - color ID to make transparent" exit 1 fi giftoppm $1 | ppmtogif -interlace -transparent $2 > /tmp/$$.gif if [ $? -eq 0 ] then mv /tmp/$$.gif $1 else rm /tmp/$$.gif fi Make the script executable using the chmod command. Usage is as follows: transparize In addition, there is a document explaining transparent GIFs available at the URL http://melmac.corp.harris.com/transparent_images.html . You can fetch the program giftrans by anonymous ftp from ftp.rz.uni-karlsruhe.de at the path /pub/net/www/tools/giftrans.c. There is also a Perl Script (URL is: ) which makes transparent GIFs. There are also five utilities for the Macintosh, Transparency ( ), Graphic Converter (available from the "usual Macintosh FTP sites", such as mac.archive.umich.edu; see the Macintosh newsgroups for general information on where to retrieve Macintosh software), Imagery (again, available from many Macintosh FTP sites), and clip2gif (available by anonymous FTP from orathost.cfa.ilstu.edu in the directory /public/oratClasses/ART389.88Seminar/software ). A unique approach to the problem is offered by Imagizer (URL is ), which transforms your images on the fly when sending them to the user, supporting thumbnails and TIFF-GIF conversion as well as interlacing. (Of course, there is a tradeoff between storage space and CPU usage.) _________________________________________________________________ _World Wide Web FAQ_ WHICH FORMAT IS BETTER FOR WWW IMAGE PURPOSES, JPEG OR GIF? JPEG does a better job with realistic images such as scanned photographs. Netscape can handle JPEGs, and support has arrived in many Mosaic flavors as well. To allow those with other browsers to view your JPEGs, wrap a "normal" link to the image around the inline image so the user can download the image for display in an external viewer. GIF does a better job with crisp, sharp images, such as those typically used to construct buttons, graphs and the like. All browsers that can display graphics at all can display GIFs inline. _________________________________________________________________ _World Wide Web FAQ_ CAN I BUY SPACE ON AN EXISTING SERVER? Yes, you can. A list of sites offering WWW space for lease is available (at the URL http://union.ncsa.uiuc.edu/HyperNews/get/www/leasing.html ). _________________________________________________________________ _World Wide Web FAQ_ HOW DO I MAKE A "LINK" THAT DOESN'T LOAD A NEW PAGE? Such links are useful when a form is intended to perform some action on the server machine without sending new information to the client, or when a user has clicked in an undefined area in an image map; these are just two possibilities. A CGI script (see the CGI section) can accomplish this by outputting just the following: Status: 204 No Content Followed by two line feeds (ascii 10 decimal). The web browser will take no action. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I MIRROR PART OF ANOTHER SERVER? Scripts are available to do this, but at this time they are not very friendly to the server you are attempting to mirror; their behavior resembles that of the more poorly written WWW robots. If you are trying to improve access times to a distant server, you will likely find the "proxy" capabilities of CERN's WWW server to be a more effective and general solution to your problem. _________________________________________________________________ _World Wide Web FAQ_ DO MAILTO: URLS WORK IN ALL BROWSERS? The mailto: URL is a feature found in Lynx, Netscape, Spry Mosaic, the latest NCSA Mosaics, Emacs w3 mode and many other browsers. In general, about 80% of web browsers support mailto: at the time of this writing. However, it is not in numerous older browsers. It is of course also possible to set up forms which send mail to you; see the entry regarding email forms. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I SERVE [WORD DOCUMENTS, EXCEL SPREADSHEETS, DOUGHNUTS]? In order to deliver documents of new and different types from your server, you need to configure the correct "MIME type" for each type of document, and use the proper extension when naming the file on the server. If the document type is highly unusual, you will also need to see to it that users know what MIME type to configure their browsers for, and what application to launch for that MIME type. More information on this subject is available in Ken Jenks' file format recommendations for web servers, . _________________________________________________________________ _World Wide Web FAQ_ HOW DO I PUBLICIZE MY WORK? There are several things you can do to publicize your new HTML server or other offering: * Post to comp.infosystems.www.announce. PLEASE READ THE CHARTER POSTING FIRST. In general, always read a newsgroup first to familiarize yourself before posting to it. * Submit it to Yahoo (URL is ), an impressive index of the web which expands its knowledge automatically but permits the direct submission of URLs as well. * Submit it to the NCSA What's New Page at the URL http://www.ncsa.uiuc.edu/SDG/Software/Mosaic/Docs/whats-new.html (see the page for details on how to submit your listing!). * Register your URL in the Lycos Database (URL is ). * Submit your URL to the maintainers of various catalogs, such as the WWW Virtual Library (at the URL http://www.w3.org/hypertext/DataSources/bySubject/Overview.html ) and the ALIWEB index (at the URL http://web.nexor.co.uk/aliweb/doc/aliweb.html ). * Read Gareth Rees' guide to publishing on the World Wide Web. (URL is http://www.cl.cam.ac.uk/users/gdr11/publish.html ). * Consult Pete Page's How to Announce your New Web Site (URL is ). _________________________________________________________________ _World Wide Web FAQ_ HEY, I KNOW, I'LL WRITE A WWW-EXPLORING ROBOT! WHY NOT? Programs that automatically traverse the web can be quite useful, but have the potential to make a serious mess of things. Robots have been written which do a "breadth-first" search of the web, exploring many sites in a gradual fashion instead of aggressively "rooting out" the pages of one site at a time. Some of these robots now produce excellent indexes of information available on the web. But others have written simple depth-first searches which, at the worst, can bring servers to their knees in minutes by recursively downloading information from CGI script-based pages that contain an infinite number of possible links. (Often robots can't realize this!) Imagine what happens when a robot decides to "index" the CONTENTS of several hundred mpeg movies. Shudder. The moral: a robot that does what you want may already exist; if it doesn't, please study the document World Wide Web Robots, Wanderers and Spiders (URL is: http://web.nexor.co.uk/mak/doc/robots/robots.html ) and learn about the emerging standards for exclusion of robots from areas in which they are not wanted. You can also read about existing robots there. _________________________________________________________________ _World Wide Web FAQ_ HOW CAN I PUT AN ACCESS COUNTER ON MY HOME PAGE? First of all, don't. It defeats caching proxy servers, putting more load on your server. It forces your server to run an external program for every page with a counter on it, putting more load on your server. And it advertises your demographics or lack of them to the world. _"Yeah, but I want to know how many people are accessing my page."_ Of course you do. Use one of the many statistics tools available to analyze the access log of your web server. Even if you are not the webmaster of your server, your admin will probably give you read-only access to the log files. _"I want an access counter anyway."_ In that case, consider the index of access counter software at Yahoo (URL is ). Keep in mind that you must have CGI access at a minimum, and server-side includes must also be turned on unless you are willing to build your entire page with CGI or use a program that generates the access count as an inline image. None of the above approaches are efficient. _________________________________________________________________ _World Wide Web FAQ_ ARE THERE BOOKS ABOUT THE WEB? Yes, quite a few. A brief list follows. _New entries are solicited._ Please include ISBN numbers and/or ordering information. HTML Web Publisher's Construction Kit From Waite Group Press. $36.95. 700 pages. By David Fox and Troy Downing. Covers the proposed HTML 3.0 standard, CGI programming, server setup, and browser and editor issues. CD included. For more information, see . The Web Server Book From Ventana Press. $49.95. 650 pages. By Jonathan Magid, R. Douglas Matthews, and Paul Jones (the sunsite.unc.edu team). A guide to creating a Web Server under Unix. Including server software, security, HTML, conversion, verification, graphics and multimedia, searching and indexing, forms, CGI, and next-generation developments. Includes CD-ROM with Linux, Netscape, and source and binaries for common UNIX platforms to all the tools discussed. Sample chapter, updates, and special order price available at . ISBN: 1-56604-234-8. World Wide Web: Beneath the Surf From UCL Press. By Mark Handley and Jon Crowcroft. A look at the technologies that underly the World Wide Web. The authors have taken the unusual step of making the entire book available online . ISBN: 1-85728-435-6. HTML Pocket Reference Card From Specialized Systems Consultants, Inc. 16-page reference foldout. $4.50. Covers HTML, URLs and related topics. ISBN 0-916151-79-4. How to Publish on the Internet From Warner Books. 275 pages. By Andrew Fry and David Paul. $17.95. Includes SPRY Mosaic software. A guide to publishing on the Web: HTML, graphics, style, strategies for maximizing audience by building information communities. Online "Next Chapter" includes an excerpt, New Tools review, links to Web publishing resources (URL is ). ISBN 0-446-67179-7. Running a Perfect Web Site From Que. 457 pages. By David M. Chandler. A complete guide to setting up a Web server, including hardware/communications issues, HTML, forms, CGI scripts, and server-side includes. Includes a CD containing Windows HTTPD, NCSA httpd for Unix, HTML authoring tools, and dozens of SLIP utilities. ISBN 0-7897-0210-X. Read more about it at . Using Netscape From QUE. 350 pages. A user's guide to Netscape, including information on how to search the web and a disk containing Netscape itself. By Warren Ernst. ISBN: 0-7897-0211-8. $19.99 US. "Your Internet discount price: $15.99." The HTML Sourcebook From John Wiley and Sons. 411 pages. By Ian S. Graham. Contains a detailed description of HTML 2.0, including allowed element nestings, as well as descriptions of many HTML 3.0 features. There are also chapters on CGI and the HTTP protocol, complete with examples, and chapters discussing browsers, editors, servers and archive sites containing useful CGI programs and Web development tools. A review can be found at . ISBN: 0-471-11849-4. HTML Reference Card From SSC. 16-panel reference card. Covers basic and advanced HTML tags. $4.50. ISBN: 0-916151-79-4. The World Wide Web Handbook From International Thomson Computer Press. 350 pages. Covers getting connected to the web, designing HTML pages, and establishing a web server. Includes additional material on SGML and information regarding HTML 3.0. GBP 26.50. ISBN: 1-850-32205-8. The Mosaic Handbook (Mac, Windows and X editions) From O'Reilly. A short, sweet guide to the World Wide Web from a Mosaic user's perspective. Mac and Windows versions Include Enhanced NCSA Mosaic on floppy disk; the X Window System version includes NCSA Mosaic on CD-ROM. Telnet or gopher to gopher.ora.com (log in as gopher) or find details on the web ( Mac: , Windows: , and X Window System ). Wherever books with arcane fauna on the cover are sold. The World Wide Web Unleashed From Sams Publishing. By John December and Neil Randall. Additional chapters contributed by others; I wrote the chapter on HTML editors and filters. Covers both user and provider issues in detail. Supporting pages available on the web (URL is http://www.rpi.edu/~decemj/works/wwwu.html ). 1057 pages. ISBN: 0-672-30617-4. Call 1-800-428-5331 or +1-317-581-3500 for ordering information. Spinning the Web: How to Provide Information on the Internet From Van Nostrand Reinhold. By Andrew Ford. Oriented toward those with an interest in putting their data on the web. ISBN: 1-850-32141-8 (New York), 0-442-01962-9 (London). Available in December 1994. Teach Yourself Web Publishing with HTML in a Week From Sams Publishing. By Laura Lemay. Also oriented toward those who plan to publish materials on the web. ISBN: 0-672-30667-0. 400 pages. Includes information on setting up servers and handling forms results as well as HTML writing and editing. (URL is: http://slack.lne.com/lemay/theBook/index.html ) Available December 22nd, 1994. Call 1-800-428-5331 or +1-317-581-3500 for ordering information. The HTML Manual of Style From Ziff-Davis Press. By Larry Aronson. Chapters: introduction to the WWW, the HTML language, writing HTML documents, and HTML examples. 120 pages. Available in December 1994. The Internet via Mosaic and World-Wide Web From Ziff-Davis Press. By Steve Browne. Details on obtaining Mosaic and Trumpet Winsock, getting it all set up, and what to do with it once it works. A chapter of interesting sites on the Web as well. ISBN: 1-56276-259-1. MOSAIC Quick Tour From Ventana Press. By Gareth Branwyn. A good guide to installing and using NCSA Mosaic under Windows. Includes basic HTML and trouble-shooting chapters. "More hand-holding than the FAQ and gives lots of details." - Mari J. Stoddard Managing Internet Information Services From O'Reilly and Associates. By Cricket Liu, Jerry Peek, Russ Jones, Bryan Buus & Adrian Nye. A good choice for those who will be installing and maintaining WWW servers; also includes documentation on HTML, imagemaps and the like. Also covers other types of Internet services. See for more information. Hands-On Mosaic: A Guide for Window Users From Prentice Hall. By Dr. David Sachs & Henry Stair. ISBN: 0-13-172321-9. HTML Authoring for Fun & Profit From Prentice Hall. By Mary Morris. Jan 1995. ISBN: 0-13-359290-1. NCSA Mosaic Handbook From Prentice Hall. By Amy K. Kreiling & Frank Baker. Jan 1995. ISBN: 0-13-196692-8. Plug-n-Play Mosaic for Windows From Sams. By Angela Gunn. ISBN 0-672-30627-1. 300 pages. Disks include a special version of Enhanced NCSA Mosaic for Windows with built-in TCP/IP Winsock and dialer, and an automated configuration program (hence "plug-n-play"). The book is an introduction to Mosaic and the Web with some coverage of creating a home page and HTML and, of course, the obligatory directory of Web sites. Using Mosaic From Que. Ed. by Que Development Group. ISBN: 0-7897-0021-2. Covers NCSA Mosaic for Windows and the Macintosh. Using the World Wide Web From Que. Ed. by Que Development Group. ISBN: 0-7897-0016-6. Mosaic User's Guide From MIS Press. By Bryan Pfaffenberger. ISBN: 1-55828-409-5. Using Mosaic for Windows From Electric Avenue Press. By Stephen Gauer. ISBN: 0-969-8853-0-X. _________________________________________________________________ _World Wide Web FAQ_ WHAT MAILING LISTS DISCUSS THE WEB? There are many mailing lists about the web, and they come and go rather quickly. Please see the W3 Consortium mailing lists page and the W3 Consortium's list of other known mailing lists about the web for more information. _________________________________________________________________ _World Wide Web FAQ_ WHAT NEWSGROUPS DISCUSS THE WEB? You can find information about World Wide Web topics in fifteen distinct newsgroups. They are subdivided for good reasons; use the ONE newsgroup most relevant to your topic, please. Note that two searchable archives of the www newsgroups are available. * American Web Services * Critical Mass Communications * Authoring-Related Groups comp.infosystems.www.authoring.cgi This newsgroup covers discussion of the development of Common Gateway Interface (CGI) scripts as they relate to Web page authoring. Possible subjects include discussion how to handle the results of forms, how to generate images on the fly, and how to put together other interactive Web offerings. comp.infosystems.www.authoring.html This newsgroup covers discussion of HyperText Markup Language (HTML) as it relates to web page authoring. Possible subjects include HTML editors, formatting tricks, and current and proposed HTML standards. comp.infosystems.www.authoring.images This newsgroup covers discussion of the creation and editing of images as they relate to web page authoring. Possible subjects include how best to leverage the image-display capabilities of the web and common questions and solutions for putting up imagemaps. comp.infosystems.www.authoring.misc This newsgroup covers miscellaneous World-Wide Web authoring issues not covered by the other c.i.w.authoring.* groups. Possible subjects include the use of audio and video, etc. * Browser software -- related groups comp.infosystems.www.browsers.mac This newsgroup covers discussion of World-Wide Web browsers for the Macintosh platform. Possible subjects include configuration questions/solutions, external viewers (helper applications), and bug reports. comp.infosystems.www.browsers.ms-windows This newsgroup covers discussion of World-Wide Web browsers for the MS Windows and NT platforms. Possible subjects include configuration questions/solutions, external viewers (helper applications), and bug reports. comp.infosystems.www.browsers.x This newsgroup covers discussion of World-Wide Web browsers for the X-Window system. Possible subjects include configuration questions/solutions, external viewers (helper applications), and bug reports. comp.infosystems.www.browsers.misc This newsgroup covers discussion of World-Wide Web browsers for all other platforms. Possible subjects include configuration questions/solutions, external viewers (helper applications), and bug reports. Platforms included are Amiga, DOS (*not* Windows), VMS, and Unix text-mode. * Web Server -- related groups comp.infosystems.www.servers.mac This newsgroup covers discussion of World-Wide Web servers for the Macintosh (MacOS) platform. Possible subjects include configuration questions/solutions, security issues, directory structure, and bug reports. comp.infosystems.www.servers.ms-windows This newsgroup covers discussion of World-Wide Web servers for the MS Windows and NT platforms. Possible subjects include configuration questions/solutions, security issues, directory structure, and bug reports. comp.infosystems.www.servers.unix This newsgroup covers discussion of World-Wide Web servers for Unix platforms. Possible subjects include configuration questions/solutions, security issues, directory structure, and bug reports. comp.infosystems.www.servers.misc This newsgroup covers discussion of World-Wide Web servers for other platforms, such as Amiga, VMS, and others. Possible subjects include configuration questions/solutions, security issues, directory structure, and bug reports. * Other Discussion comp.infosystems.www.advocacy This newsgroup is for comments, arguments, debates, and discussions about which Web browsers, servers, external viewer programs, and other software is better or worse than any other. Posts should not be crossposted to this group and to any other Web group. However, this group is a good place to direct follow-ups if a thread in another Web group begins to take on a "this program is better than that one" flavor. Possible subjects include: "The web is better than print"; "Netscape is better than anything else"; "CERN httpd kicks butt"; etc. comp.infosystems.www.misc comp.infosystems.www.misc (unmoderated) provides a forum for general discussion of WWW (World Wide Web)- related topics that are NOT covered by the other newsgroups in the hierarchy. This will likely include discussions of the Web's future, politicking regarding changes in the structure and protocols of the web that affect both clients and servers, et cetera. * Announcements comp.infosystems.www.announce A newsgroup in which new web-related resources can be announced. READ THE GROUP FIRST to find the posting guidelines. * Obsolete Newsgroups comp.infosystems.www.providers This will be removed in July. comp.infosystems.www.users This will be removed in July. comp.infosystems.www Removed approximately a year ago. If your site still carries this group, ask your admin to remove it. _________________________________________________________________ _World Wide Web FAQ_ CREDITS Maintainer (11/93 to present): Thomas Boutell, _boutell@netcom.com_ Former Maintainer (until 11/93): Nathan Torkington, _Nathan.Torkington@vuw.ac.nz_ _________________________________________________________________ _World Wide Web FAQ_