The Methabot Project

A fast, scriptable web crawler system
Writing /var/www/bithack.se/projects/methabot/data/cache/9/938b3cee3e8e5f0f141dae1e719ce810.i failed
Unable to save cache file. Hint: disk full; file permissions; safe_mode setting.
Writing /var/www/bithack.se/projects/methabot/data/cache/9/938b3cee3e8e5f0f141dae1e719ce810.xhtml failed

Differences

This shows you the differences between the selected revision and the current version of the page.

howto:jail_methabot_in_a_specific_directory 2009/02/24 00:34 howto:jail_methabot_in_a_specific_directory 2009/02/24 00:35 current
Line 2: Line 2:
//Support for the %%--jail%% option was added in **libmetha/1.6.0**// //Support for the %%--jail%% option was added in **libmetha/1.6.0**//
-The %%--jail%% or %%-j%% option lets you jail methabot in a specific directory. Let's say you are extracting image files from a website, but you don't want Methabot to go up a directory to other content. This is when jailing is useful. Consider the URL http://example.com/images/, to jail Methabot in the /images/ directory, run:+The %%--jail%% or %%-j%% option lets you jail methabot in a specific directory. Let's say you are extracting image files from a website, but you don't want Methabot to go up a directory to other content. This is when jailing is useful. Consider the URL %%http://example.com/images/%%, to jail Methabot in the /images/ directory, run:
<code> <code>
$ mb :images -j example.com/images/ $ mb :images -j example.com/images/
</code> </code>
-Note that -j does not take an argument. Methabot will be jailed in the directory of the first URL, you could for example run it on http://example.com/images/page2.html and Methabot would still be jailed in /images/ because page2.html is in that directory.+Note that -j does not take an argument. Methabot will be jailed in the directory of the first URL, you could for example run it on %%http://example.com/images/page2.html%% and Methabot would still be jailed in /images/ because page2.html is in that directory.
 
 
howto/jail_methabot_in_a_specific_directory.txt · Last modified: 2009/02/24 00:35 by sdac