<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.encyclosphere.org/index.php?action=history&amp;feed=atom&amp;title=Database</id>
	<title>Database - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.encyclosphere.org/index.php?action=history&amp;feed=atom&amp;title=Database"/>
	<link rel="alternate" type="text/html" href="https://wiki.encyclosphere.org/index.php?title=Database&amp;action=history"/>
	<updated>2026-04-22T15:06:09Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.41.0</generator>
	<entry>
		<id>https://wiki.encyclosphere.org/index.php?title=Database&amp;diff=488&amp;oldid=prev</id>
		<title>Hampson: Created page with &quot;A database is a collection of ZWI files, usually hosted by an aggregator. More documentation can be found at [https://docs.encyclosphere.org/#/databases this link].  == Structure == Here's what the file tree of a typical ZWI database looks like:  &lt;code&gt;database/  └── '''en &lt;————'''      ├── examplepedia      │   ├── en.examplepedia.org      │   │   └── wiki#Example_article.zwi      │   └── en.examplepedia.org....&quot;</title>
		<link rel="alternate" type="text/html" href="https://wiki.encyclosphere.org/index.php?title=Database&amp;diff=488&amp;oldid=prev"/>
		<updated>2024-02-21T21:26:00Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;A database is a collection of &lt;a href=&quot;/ZWI&quot; title=&quot;ZWI&quot;&gt;ZWI&lt;/a&gt; files, usually hosted by an &lt;a href=&quot;/Aggregator&quot; title=&quot;Aggregator&quot;&gt;aggregator&lt;/a&gt;. More documentation can be found at [https://docs.encyclosphere.org/#/databases this link].  == Structure == Here&amp;#039;s what the file tree of a typical ZWI database looks like:  &amp;lt;code&amp;gt;database/  └── &amp;#039;&amp;#039;&amp;#039;en &amp;lt;————&amp;#039;&amp;#039;&amp;#039;      ├── examplepedia      │   ├── en.examplepedia.org      │   │   └── wiki#Example_article.zwi      │   └── en.examplepedia.org....&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;A database is a collection of [[ZWI]] files, usually hosted by an [[aggregator]]. More documentation can be found at [https://docs.encyclosphere.org/#/databases this link].&lt;br /&gt;
&lt;br /&gt;
== Structure ==&lt;br /&gt;
Here's what the file tree of a typical ZWI database looks like:&lt;br /&gt;
 &amp;lt;code&amp;gt;database/&lt;br /&gt;
 └── '''en &amp;lt;————'''&lt;br /&gt;
     ├── examplepedia&lt;br /&gt;
     │   ├── en.examplepedia.org&lt;br /&gt;
     │   │   └── wiki#Example_article.zwi&lt;br /&gt;
     │   └── en.examplepedia.org.csv.gz&lt;br /&gt;
     ├── trash&lt;br /&gt;
     │   └── examplepedia&lt;br /&gt;
     │       ├── en.examplepedia.org&lt;br /&gt;
     │       │   └── wiki#Deleted_article.zwi&lt;br /&gt;
     │       └── en.examplepedia.org.csv.gz&lt;br /&gt;
     └── index.csv.gz&amp;lt;/code&amp;gt;&lt;br /&gt;
There's an &amp;lt;code&amp;gt;en&amp;lt;/code&amp;gt; folder in the top level of this database. &amp;lt;code&amp;gt;en&amp;lt;/code&amp;gt; is the 2-letter ISO language code for English. (Here's a handy list of language codes.) In the top level of the database, there is a folder for each 2-letter language code, containing all of the articles in the corresponding language.&lt;br /&gt;
 &amp;lt;code&amp;gt;database/&lt;br /&gt;
 └── en&lt;br /&gt;
     ├── '''examplepedia &amp;lt;————'''&lt;br /&gt;
     │   ├── en.examplepedia.org&lt;br /&gt;
     │   │   └── wiki#Example_article.zwi&lt;br /&gt;
     │   └── en.examplepedia.org.csv.gz&lt;br /&gt;
     ├── trash&lt;br /&gt;
     │   └── examplepedia&lt;br /&gt;
     │       ├── en.examplepedia.org&lt;br /&gt;
     │       │   └── wiki#Deleted_article.zwi&lt;br /&gt;
     │       └── en.examplepedia.org.csv.gz&lt;br /&gt;
     └── index.csv.gz&amp;lt;/code&amp;gt;&lt;br /&gt;
In each language folder (e.g. &amp;lt;code&amp;gt;en&amp;lt;/code&amp;gt;), there is a folder for each publisher, containing the articles from that publisher.&lt;br /&gt;
 &amp;lt;code&amp;gt;database/&lt;br /&gt;
 └── en&lt;br /&gt;
     ├── examplepedia&lt;br /&gt;
     │   ├── en.examplepedia.org&lt;br /&gt;
     │   │   └── wiki#Example_article.zwi&lt;br /&gt;
     │   └── en.examplepedia.org.csv.gz&lt;br /&gt;
     ├── '''trash &amp;lt;————'''&lt;br /&gt;
     │   └── examplepedia&lt;br /&gt;
     │       ├── en.examplepedia.org&lt;br /&gt;
     │       │   └── wiki#Deleted_article.zwi&lt;br /&gt;
     │       └── en.examplepedia.org.csv.gz&lt;br /&gt;
     └── index.csv.gz&amp;lt;/code&amp;gt;&lt;br /&gt;
Also in the language folder, there's a special &amp;lt;code&amp;gt;trash&amp;lt;/code&amp;gt; folder, containing ZWI files marked for deletion. The &amp;lt;code&amp;gt;trash&amp;lt;/code&amp;gt; folder has the same structure as the parent language folder.&lt;br /&gt;
 &amp;lt;code&amp;gt;database/&lt;br /&gt;
 └── en&lt;br /&gt;
     ├── examplepedia&lt;br /&gt;
     │   ├── '''en.examplepedia.org &amp;lt;————'''&lt;br /&gt;
     │   │   └── wiki#Example_article.zwi&lt;br /&gt;
     │   └── en.examplepedia.org.csv.gz&lt;br /&gt;
     ├── trash&lt;br /&gt;
     │   └── examplepedia&lt;br /&gt;
     │       ├── en.examplepedia.org&lt;br /&gt;
     │       │   └── wiki#Deleted_article.zwi&lt;br /&gt;
     │       └── en.examplepedia.org.csv.gz&lt;br /&gt;
     └── index.csv.gz&amp;lt;/code&amp;gt;&lt;br /&gt;
In each publisher folder (e.g. &amp;lt;code&amp;gt;examplepedia&amp;lt;/code&amp;gt;), there is a folder for each domain used by the publisher, containing the articles hosted on that domain. In this example, the only domain is &amp;lt;code&amp;gt;en.examplepedia.org&amp;lt;/code&amp;gt;.&lt;br /&gt;
 &amp;lt;code&amp;gt;database/&lt;br /&gt;
 └── en&lt;br /&gt;
     ├── examplepedia&lt;br /&gt;
     │   ├── en.examplepedia.org&lt;br /&gt;
     │   │   └── '''wiki#Example_article.zwi &amp;lt;————'''&lt;br /&gt;
     │   └── en.examplepedia.org.csv.gz&lt;br /&gt;
     ├── trash&lt;br /&gt;
     │   └── examplepedia&lt;br /&gt;
     │       ├── en.examplepedia.org&lt;br /&gt;
     │       │   └── wiki#Deleted_article.zwi&lt;br /&gt;
     │       └── en.examplepedia.org.csv.gz&lt;br /&gt;
     └── index.csv.gz&amp;lt;/code&amp;gt;&lt;br /&gt;
In the domain folders (e.g. &amp;lt;code&amp;gt;en.examplepedia.org&amp;lt;/code&amp;gt;) are the ZWI files themselves.&lt;br /&gt;
&lt;br /&gt;
The names of the ZWI files start with the part of the &amp;lt;code&amp;gt;SourceURL&amp;lt;/code&amp;gt; after the domain, with the slashes replaced by pound signs (&amp;lt;code&amp;gt;#&amp;lt;/code&amp;gt;). (&amp;lt;code&amp;gt;SourceURL&amp;lt;/code&amp;gt; is a field in the ZWI file's &amp;lt;code&amp;gt;metadata.json&amp;lt;/code&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
Any existing pound signs in the URL are escaped with another pound sign: &amp;lt;code&amp;gt;##&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Finally, &amp;lt;code&amp;gt;.zwi&amp;lt;/code&amp;gt; is added.&lt;/div&gt;</summary>
		<author><name>Hampson</name></author>
	</entry>
</feed>