The Methabot Project

A fast, scriptable web crawler system
Writing /var/www/bithack.se/projects/methabot/data/cache/9/938b3cee3e8e5f0f141dae1e719ce810.i failed
Unable to save cache file. Hint: disk full; file permissions; safe_mode setting.
Writing /var/www/bithack.se/projects/methabot/data/cache/9/938b3cee3e8e5f0f141dae1e719ce810.xhtml failed

Differences

This shows you the differences between the selected revision and the current version of the page.

index 2009/06/23 16:32 index 2009/10/08 14:58 current
Line 1: Line 1:
====== What is Methabot? ====== ====== What is Methabot? ======
-**Methabot** is an open source web crawler and command line tool **optimized for speed**. It supports scripted filetype parsing, a wide variety of [[option_reference|customization options]] and is [[docs/configuration_file_basics|easily configured]] to fit anyones [[howto/index|particular needs]].+**Methabot** is an open source web crawler and command line tool **optimized for speed**. It supports scripted filetype parsing, a wide variety of customization options and is easily configured to fit anyones particular needs.
**WEBSITE MOVED**: This project has moved to a new website: http://metha-sys.org/ **WEBSITE MOVED**: This project has moved to a new website: http://metha-sys.org/
- 
-**Latest Release**: [[http://metha-sys.org/|Methanol/1.7.0]] 
===== Latest News ===== ===== Latest News =====
Line 12: Line 10:
Methabot is rich with fine features, some of them, but not all, are listed below. Methabot is rich with fine features, some of them, but not all, are listed below.
  * It's fast, designed from the ground and up with speed-optimization in mind.   * It's fast, designed from the ground and up with speed-optimization in mind.
-  * Scriptable through Javascript with [[E4X]]+  * Scriptable through Javascript with E4X
  * User-defined filetype filtering (according to MIME type, file extension or UMEX expression)   * User-defined filetype filtering (according to MIME type, file extension or UMEX expression)
  * Multi-threaded   * Multi-threaded
  * Highly configurable from command line   * Highly configurable from command line
  * Extensible module system, supporting custom data parsers, filters and protocol handlers.   * Extensible module system, supporting custom data parsers, filters and protocol handlers.
-  * MySQL support through the Javascript-MySQL binding ([[/docs/modules/lmm_mysql/|lmm_mysql]]). +  * MySQL support through the Javascript-MySQL binding (lmm_mysql). 
-  * Simple yet powerful filtering of URLs through [[UMEX]].+  * Simple yet powerful filtering of URLs through UMEX.
  * Automated downloading   * Automated downloading
  * Support for automatic cookie handling when running over HTTP   * Support for automatic cookie handling when running over HTTP
-  * [[docs/robotstxt|Robots Exclusion Standard]]+  * Robots Exclusion Standard
  * Reliable, fault-tolerant networking, redirect-loop detection and some spider trap detection   * Reliable, fault-tolerant networking, redirect-loop detection and some spider trap detection
  * Parser chaining, share data easily between C and javascript parsers   * Parser chaining, share data easily between C and javascript parsers
Line 31: Line 29:
=====Further Project Information===== =====Further Project Information=====
-  * [[License]] +http://metha-sys.org/ 
-  * [[developers|Developer Team]] +
-  * [[support|Contact]] +
-  * [[http://sourceforge.net/projects/methabot/|Sourceforge Project Page]]+
 
 
index.1245767569.txt.gz · Last modified: 2009/06/23 16:32 by sdac