4. WWW copies

Instead of printed brochures, many institutions publish their information on the World Wide Web (WWW). So we decided to include a copy of their web pages. Though we have the following regulations, we believe the information copied on this CD are useful and sufficient.

The web pages were retrieved recursively, down to 5 depths. For example, if the main page of the institute were http://www.kokken.go.jp/, then something like http://www.kokken.go.jp/one/two/three/four/five.html would be fetched, while http://www.kokken.go.jp/one/two/three/four/five/six.jpg and http://www.kokken.go.jp/one/two/three/four/five/six.html would simply be ignored.

This CD was made with Joliet extention format which may be imcompatible with some computer OSs. And Joliet extention itself has regulations: filenames cannot contain more than 64 letters. http://www.kokken.go.jp/eenie_meenie_minie_moe/catch_a_tiger_by_its_toe.html, for example, has 69 letters (excluding the "http://" prefix). WWW copies are placed in the "mirror" directory, so actually URLs containing more than 57 letters will be corrupted.


Readme for Research on the World's Language Research Institutes: WWW copies
Previous: Software RequirementsNext: Restrictions