Category Archives: Sinosphere

End of the Cantonese Goldlisting Project

In March 2015, I posted about my Cantonese experiment, where I used a list of characters and a dictionary to goldlist Cantonese. At first, I thought it was just going to be a slightly inefficient way of going about it, but there were several unforeseen problems that made the experiment a major failure. Now that the project is in the closing stages and I have to wait before I can keep distilling, I wanted to write about what I learned. To be frank, it was more of a waste of time and effort than expected, and I would not do it again.

The whole project has spanned five years—seven if I count the work I did before the actual experiment. While I haven’t actually spent five years on it, this project has caused me to lose more time than what I actually spent working on it. For most of the last five years, I haven’t done much goldlisting. This was partly because of work, but also because all the problems that became apparent really demotivated me. That was probably the worst outcome of the entire project, because I could have finished many shorter projects instead of wasting time on one big one.

Length of the Project

I originally thought it would only take me about 20,000 headlist lines, but I didn’t stop until I reached 40,000. There was still no end in sight, and it was apparent that I had included hundreds (if not thousands) of words I shouldn’t have. There were also many duplicates, because it was hard to keep track of which characters I had already covered.

Lack of Context

I knew this before I started, but I didn’t think lack of context would be such a big problem. I had done small-scale projects like this before, and those worked out well. I sometimes misunderstood what a word meant or how to use it, but then I’d encounter it in the wild and realize the mistake, and although I had no context, the words I was learning were all very basic, so I could always imagine a context and be fairly sure it was accurate. This time, the words ranged from very basic all the way to needlessly advanced, and I had no idea which were which, because although CantoDict does divide them up into five categories, almost all of the words are put in the middle category, which kind of defeats the purpose.

Source Material

The biggest problem turned out to be my choice of source material. CantoDict is maintained by volunteers who work when they can and want to. Nobody is responsible for checking every single entry to ensure accuracy, so there are many inaccurate entries. I suspect these were added and parsed automatically, because they often lack the changed tones and are listed with their citation tones instead, and others use the wrong reading of one of the characters.

For this reason, it is useful as a reference or second opinion, but should not be relied upon alone (but be warned: many online dictionaries get their Cantonese data directly from CantoDict). Fortunately, the editors do a good job linking to discussions about each entry when they come up on the forums, in which case you do get a second opinion right there.

Early on, I did not realize just how many wrong entries there would be. I used the forums to ask when I suspected something was off, but I didn’t always get a reply there either, so I soon resorted to asking friends instead. Eventually, I got pretty good at spotting suspicious words to ask about, and this revealed another problem, namely that many of the words and expressions were completely unknown to my friends! Often, I’d ask how to pronounce something, and they’d ask me what it was supposed to mean.

The sheer amount of such words was the main reason I not only stopped adding to the headlist, but also simply gave up and crossed out hundreds of the words I had left. I crossed out anything that seemed suspicious, although I’d sometimes ask someone to make sure. I’m sure I also crossed out many legitimate words by doing it this way, but it was better than wasting more time learning the useless ones.

Final Stages

The last stages of the project involved filling up the last bronze book and distilling all the words I have left. The reason for this is that the CantoDict headlist only reached 38725 lines, so I had space left over. I filled it up with Teach Yourself Cantonese, Colloquial Cantonese, and Intermediate Cantonese. I wish I had started with these instead of saving them for last, but they also came with their own problems, which I may write about later.

After cutting out all the words I didn’t trust, there wasn’t that much left to distill, but I did it anyway. Now, I’m all caught up and waiting to be able to keep distilling. I estimate that I’ll finish the entire project in less than two months from now and have 16 bronze books, 3 silver books, and 1 gold book to show for it.


This experiment was really a failure. It was meant to be a somewhat inefficient way to achieve a very useful goal, but it turned out not to be. Even if I had used a more efficient method with the same source material, I would just have failed faster (which would have been preferable, but now I know).

That’s not to say I got nothing out of it, though. I did learn thousands of useful words and expressions and I got really good at spotting suspicious information about Cantonese (which did come in handy as I was going through the textbooks at the end—Cantonese learning materials always come with lots of mistakes in them, unfortunately!). Still, it wasn’t worth the wasted time and effort in the end, and I would not do it this way again.

The Way Forward

Although the experiment is over, my Cantonese goldlisting doesn’t have to be. I only got halfway through Intermediate Cantonese before running out of pages in my bronze book. I could start a 17th bronze book and keep going, but I haven’t decided yet. For future projects, I’ll also choose dictionaries that at least have example sentences. I do happen to have one for Cantonese, namely 東方廣東語辭典 (a Cantonese–Japanese dictionary that is better than any Cantonese–English one I’ve seen), so I might goldlist that and see how it goes. It is unfortunately not without errors either, but it’s far more accurate than CantoDict.

著 and 着

Few characters are as quirky as 著. It seems to have been a variant that eventually split off from 箸, and now its own variant 着 is in the process of splitting off from 著. This illustrates a trend in the development of Chinese characters: they often start out as one sign that may represent several different (though usually similar-sounding) syllables and thus several meanings. Then, they split into more signs that share the burden of sounds and meanings associated with them. It’s rare for characters to merge, but there are many examples of such splits.


The Republic of China

It has been assigned 5 common readings in the national language of the Republic of China, or Standard Mandarin: ㄓㄨˋ、ㄓㄨㄛˊ、ㄓㄠˊ、ㄓㄠˉ、˙ㄓㄜ (zhù, zhuó, zháu, zhāu, zhe), as well as 2 that are rare enough to be ignored: ㄔㄨˊ、ㄓㄨˇ (chú, zhǔ). As far as the Ministry of Education of the Republic of China is concerned, 着 is a variant of 著, and that’s the whole story.

ㄓㄨㄛˊ is a merger of two Middle Chinese readings: one with a voiceless initial consonant, and one with a voiced one (cf. Cantonese zhoek³ and zhoek⁶). It is interesting to note that ㄓㄨㄛˊ and ㄓㄠˊ probably both developed from the reading with the voiced initial (one being literary and the other colloquial), but they ended up acquiring different meanings. ㄓㄠˉ is possibly a further development of the latter.

˙ㄓㄜ is a Mandarin-specific particle that had to be written down somehow, and this character did the job.

ㄔㄨˊ is only used in 著雍・著雝 (ㄔㄨˊ ㄩㄥˉ [chúyūng]), an alternate name for 戊, the fifth of the ten heavenly stems (十天干). ㄓㄨˇ is only used in 著任 (ㄓㄨˇ ㄖㄣˋ [zhǔrèn]), but I’m not sure what it means. Most dictionaries ignore these two, for obvious reasons.



In pre-war Japan, the situation was the same: 著 had two Sino-Japanese readings (since Japan never had the ㄓㄨㄛˊ–ㄓㄠˊ–ㄓㄠˉ split or a reading corresponding to the Mandarin particle): チョ and チャク (cho and chaku). Here from 詳解漢和字典, published just after WWII, identifying 着 as a vulgar variant of 著.



And here is the 著 entry in the same dictionary:

Screenshot at 2017-04-27 20:31:35 Screenshot at 2017-04-27 20:31:50 Screenshot at 2017-04-27 20:32:04 Screenshot at 2017-04-27 20:32:14


However, after WWII, 著 and 着 were assigned different roles: 著 for チョ, 着 for チャク. Native Japanese readings follow the meanings connected to the Sino-Japanese readings, so 著 for あらはꜜす and いちじるしꜜい (arawaꜜsu and ichijirushiꜜi) and 着 for きる and つꜜく・つくꜜ (kiru and tsuꜜku/tsukuꜜ).


Mainland China

When the Communist Party of China developed their standard, they assigned the two characters the same roles as the Japanese. 著 for ㄓㄨˋ, 着 for ㄓㄨㄛˊ、ㄓㄠˊ、˙ㄓㄜ and ㄓㄠˉ (this last one is also written 招, since they sound the same in Mandarin).



Korean usage is the same as well, suggesting this usage wasn’t just invented after the war. Korea never had any major character reforms, and still 著 is usually reserved for 저 (chŏː) and 着 is usually reserved for 착 (ch’ak). However, either can be used for either reading. In practice, though, most Koreans unfortunately don’t use characters at all anymore.



In Vietnam, characters are unfortunately used even less than in Korea, but dictionaries from the late 1800s suggest trứ and trước were both written 著, as in the Republic of China. Interestingly, there doesn’t seem to be a trược reading (corresponding to a Middle Chinese voiced initial consonant) for this character. The meanings that would be associated with that reading are listed under trước.

著 (trứ) in Bonet’s (1899) Vietnamese–French dictionary:Screenshot at 2017-04-25 21:37:54


And 著 (trước) in the same dictionary:

Screenshot at 2017-04-25 21:38:36


著 (trứ) in Génibrel’s (1898) Vietnamese–French dictionary:

Screenshot at 2017-04-25 21:35:31


And 著 (trước) in the same dictionary:

Screenshot at 2017-04-25 21:36:45 Screenshot at 2017-04-25 21:37:16


Modern dictionaries do include 着, and though they seem to prefer the reading trước for it, some also list trứ. 著 is always listed with both readings.


Hong Kong (and Macau)

Until recently, I was under the impression that Hong Kong usage, and presumably Macanese usage as well, differed from all of the above. CantoDict distinguishes them this way: 著 for zhy³ (ㄓㄨˋ) and zhoek³ (ㄓㄨㄛˊ), 着 for zhoek⁶ (ㄓㄨㄛˊ、ㄓㄠˊ、˙ㄓㄜ and ㄓㄠˉ).

著 in CantoDict (24.04.2017)

Screenshot at 2017-04-24 22:19:32


着 in CantoDict (24.04.2017)

Screenshot at 2017-04-24 22:20:07

However, my friend Kumono Shōta showed me the List of Graphemes of Commonly-used Chinese Characters, published by the Hong Kong Education Bureau. I was surprised to learn that people on the CantoDict forums seem to be wrong about the official Hong Kong division of 著 and 着. According to this list, 著 is for zhy³ (ㄓㄨˋ) and 着 is for zhoek³ (ㄓㄨㄛˊ、ㄓㄠˊ、˙ㄓㄜ and ㄓㄠˉ), just like in all the other jurisdictions (except for the Republic of China, of course).


著 in the List of Graphemes of Commonly-used Chinese Characters:


着 in the List of Graphemes of Commonly-used Chinese Characters:


Correspondance List

I made a list with the readings and the meanings the characters (roughly) correspond to!

Mandarin – Cantonese – Japanese – Korean – Vietnamese – English

ㄓㄨˋ – zhy³ – チョ – 저 – trứ – notoriety, authorship

ㄓㄨㄛˊ – zhoek³ – チャク – 착 – trước – to don

ㄓㄨㄛˊ – zhoek⁶ – チャク – 착 – trước – to make contact, to apply

ㄓㄠˊ – zhoek⁶ – チャク – 착 – trước – to ignite, to affect

ㄓㄠˉ – zhoek⁶ – チャク – 착 – trước – (boardgame) move

˙ㄓㄜ – zhoek⁶ – チャク – 착 – trước – stative particle

Visit to North Korea

In March of 2014, I went on a “March Madness” tour to the Democratic People’s Republic of Korea. I went on the same tour as Jan van der Aa and his sister, which was nice.

I had to travel through China, so I got a dual-entry visa and traveled to Peking. Although I wasn’t going to stay there for 72 hours, I couldn’t take advantage of the visa-free entry, because the Communist Party only let Norwegian citizens stay for 24 hours without a visa. I suspect they were angry about the Nobel Prize that went to Lau Hiubo (劉曉波). Thanks, Nobel Committee.

We had a briefing at Koryo Tours office, and left in a bus the next morning. I almost missed it, because I stupidly forgot my card in an ATM and ran back to retrieve it. Fortunately, the employees at Bank of China (中國銀行) were coöperative, so I got my card back and ran back to catch the bus.

When we got to the airport, we met our first north Koreans. They had just played in a football tournament, but were beaten by Portugal. It was very interesting to hear them call each other “comrade” (同志 to superiors, 동무 to everyone else). It was also fun to try talking to them in rudimentary Korean.

One of my favourite moments was when we boarded the plane. Leaving the generic-looking Chinese airport and being plunged into the atmosphere of the Air Koryo plane (from the Soviet era) was quite an experience. One of my favourite songs, My Country is the Best (내 사는 내 나라 第一로 좋아), was playing (and yes, they kept playing music until we landed). The stewardesses (all of them were female) were beautiful, and I got a seat right next to one of them. I didn’t get to take a picture with her, but since we got a newspaper that had Kim Jong-un on the front page (which every newspaper does), she did show me how to fold it in a respectful way. You can’t fold the leader of the country, so you have to fold the top and the bottom of the paper instead of the middle. The meal on the plane was something that looked like a hamburger. It was OK, I guess.

When we landed, we were reminded not to take pictures of any unfinished structures (finishing building projects quickly is apparently a matter of national pride), the military, or anything else that might put the country in a bad light. Other than that, we could take as many pictures as we liked.

We met a pair of guides, and when one of them said her name was Hwang (黃), she did it with a rising tone, so without thinking, I asked if she was Chinese. That was embarrassing. Fortunately, our guides turned out to be another pair, and they were great guides, too. Very friendly and good at their job. Both of them were called Kim (金).


On our way to Ryanggang Hotel (兩江호텔), we stopped at Pyongyang’s Arc of Triumph (凱旋門) and saw some kids on roller skates. While driving through Pyongyang, we could see the unfinished Ryugyong Hotel (柳京호텔) in the distance. Its exterior is finished, so we were allowed to take pictures of it.


We also passed the Chollima Statue (千里馬銅像), but never ended up visiting it.


The hotel was beautiful, and there was a bookstore in the lobby. By the end of the trip, I bought a book that was all about what a great man General Kim Jong-il was.


The room was one of the best I’ve ever stayed in, but I don’t usually stay in hotels anyway. It was very clean, and the bed was comfortable. The electricity kept going off, but that was convenient, because I couldn’t figure out how to turn off the lights. The downside was that I woke up every time the power came back. I found out how to control the lights eventually, and had a great night’s sleep.

Day 1

We had breakfast downstairs, and it included some great pancake things that I haven’t been able to find since. The waitress called them 알라지 (I had her check my spelling, too). The tea was also fantastic. I don’t think I’ve ever had tea that good.


As we drove through Pyongyang, one of the things I really liked was the absence of advertisements. There was a lot of slogans and calls to support the Party, but the only ad was for Pyonghwa Motors (平和自動車).


The bus took us to Kumsusan Palace of the Sun (錦繡山太陽宮殿), where the bodies of President Kim Il-sung and General Kim Jong-il lie in state. This was the most solemn part of the tour, and I had to wear a suit. We had to pass through several wind machines that blew off dust and hair from our clothes, and we walked through long corridors filled with art and other things commemorating the two leaders. We couldn’t take pictures inside, and we had to keep our arms straight down at all times. I absentmindedly put my hands behind my back at one point, and the other group’s guide scolded both me and our guide for it. Oops.

We got to see both of the leaders, and participated in a bowing ritual in order to walk around the body. First, we faced the feet and bowed. Then, we faced the body’s left side and bowed. Then, we faced the head without bowing. Then, we faced the right side and bowed before moving on to the next room. We did a lot of bowing on this trip.


Afterwards, we took pictures outside before moving on to the Revolutionary Martyrs’ Cemetary (大城山革命烈士陵), where, among others, Kim Jong-suk, President Kim Il-sung’s first wife, is buried.


Next, we visited the Mansudae Grand Monument (萬壽臺大紀念碑), where we bowed to the bronze statues of President Kim Il-sung and General Kim Jong-il. Any pictures of the statues had to include their whole body, not just part of it.


We could also see the Grand People’s Study House (人民大學習堂) on the way.


Finally got a passable picture of the Ryugyong Hotel!


These signs were everywhere, with varying messages and slogans on them. This one says 「先軍朝鮮의 太陽 金正恩將軍 萬歲!」 (“Long live the sun of Songun [‘military-first’] Korea”) in Korean letters, since Chinese characters are even rarer in the north than in the south.


Then, we visited President Kim Il-sung’s childhood home, where we saw this strange pot, which I’m sure I’ve seen on the Internet before.

Day 2


On the third day, we went to Kim Il-sung Square (金日成廣場), where the famous military parades are held.


There was no parade when we were there, and they are actually quite rare, but we did see the markings that make them possible.


This is the other side of the Grand People’s Study House, where the leader overlooks parades.


On the other side of the Taedong River (大同江), we could see the Juche Tower (主體思想塔).


We walked down the street to the foreign bookstore (外國文冊房). Our British tour leader set up the tour so that we could walk among normal people. We could also talk to them, though we didn’t have that much time.


Before going, I was told that, although we could pay in USD, RMB, or EUR, the best deals were in dollars. That was not the case. The fixed rate made EUR the best choice nearly 100% of the time, so I ended up borrowing Euros from Jan and pay him back in dollars later, using the actual rate. In this case, I bought a dictionary.


Afterwards, we went to the Pyongyang bowling hall. It was fun, and even though the power went out a few times, the machine still kept track of our scores.


Then, we drove to Kaesong (開城), where we stayed in a historical guesthouse. I opted not to shower there, since everything was old-fashioned, the water was cold, and I was lazy.


The food was great as usual.


There was also a dog soup option, which the Koreans call “sweet meat soup” (단고기국). The taste was OK, but I couldn’t finish it.

Day 3


From Kaesong, the Demilitarized Zone between north and south Korea was close by. We went to the Joint Security Area (共同警備區域) and the Peace Museum (朝鮮民主主義人民共和國平和博物館).


We even got to see some tourists on the south Korean side. We waved at them, but they were not allowed to make any moves back. It’s more relaxed on the northern side.


After that, we went back to Kaesong and up on a hill that overlooks this historical city.


We made sure to bow to this statue of President Kim Il-sung, of course.


We had some great food once again.


Then, we went to a museum. Apparently, it’s famous for its ginseng.


On the way back to Pyongyang, we stopped at this monument, the Arch of Reunification, or Monument to the Three-Point Charter for National Reunification (祖國統一三大憲章記念塔).


Then, we went to visit the Juche Tower. Too bad it was on the only cloudy day. The guides told us about the symbolism behind its measurements, and how it was completed ahead of schedule.


There were plaques for all the major donors to the tower.


I went to a public toilet, and took this picture on the way. The toilet was very clean, but I don’t have a picture of it.


It was too cloudy to get a good picture from the top, but there’s Kim Il-sung Square.


Next stop was the Monument to Party Founding (黨創建紀念塔). It was pretty cool, and the local guide was very friendly. I asked her how to say “the Workers’ Party of Korea” (朝鮮勞動黨) in Korean, because I wanted to hear whether she would say 「로동」 or 「노동」. She and another guide said 「조선노동당」 at the same time. Since I didn’t want the preceding /n/ to interfere, I repeated 「朝鮮…?」 to get them to say the next part. The man said 「로동당」, but the woman said 「노동당」, even though they were both from Pyongyang. Interesting.


Here’s Mansudae in the background. Lights go on at night to make sure the leaders’ faces are always lit up. Most of them stay on even when the power fails.

We went to visit an art gallery with lots of beautiful paintings. Many of them were of nature, but there were also quite a lot of revolutionary motifs. Mt. Paektu, the Arirang Mass Games, and a painting of the Young Pioneers that looks like an actual photograph! I also liked the painting of all the major monuments of Pyongyang.


And, of course, the President and the General.

Afterwards, we went to a brewery. I don’t drink, so it wasn’t that interesting, but the waitresses were cute.


On the way back to the hotel, I noticed that the Juche Tower actually lights up at night. Cool, but I couldn’t get a good picture of it.

Day 4


On the fourth day, we first went to Pyongyang Railway Museum (鐵道省革命史蹟館). It was mostly about the train travels of the President and General.


Most interesting to me were the old newspaper articles in mixed script. I took so many pictures of newspapers that I ended up lagging behind the group.


As always, there was a lot of information about President Kim Il-sung and General Kim Jong-il too. Not all of it had to do with trains, either.


They had a lot of old trains.


Next, we went to take the Pyongyang Metro (平壤地下鐵道). The escalator went deep down into the ground.


The metro has two or three lines; the Chollima Line (千里馬線) and the Hyoksin Line (革新線), and then there’s the Mangyongdae Line (萬景臺線), which seems to be an extension of the Chollima Line. You can press a button to activate lights that show you where to get on and off.


The stations on the Mangyongdae Line are very impressive. We started at Puhung Station (復興驛).


Then, we took the metro to Yonggwang Station (榮光驛). I was happy to hear another song I like, It is War (攻擊戰이다).


Finally, we took the metro all the way to Kaeson Station (凱旋驛), which is where the Arc of Triumph is. On the way, I sat next to a young girl who was studying Mandarin. Her English was good, too.


At Kaeson Station, this message says 「온 社會를 金日成-金正日主義化하자!」 (“Let’s Kimilsungism-Kimjongilism-ify the whole society!”) in Korean letters.

We got peach ice cream, or “eskimo” (에스키모) as the Koreans call it. It was good.


Next, we went to a factory museum. It had this interesting message: 「石炭은 工業의 食糧이다」 (“Coal is the food of industry”).


There were many old machines there.


Then, we passed one of the few things in Pyongyang to survive the Korean War, or Fatherland Liberation War (祖國解放戰爭), as they call it.


Next stop was the Grand People’s Study House, which we got to enter this time. Apparently, any citizen of the DPRK can use a computer with an Internet connection to order books from this grand library. I tried searching, but the Korean keyboard wasn’t indicated visually, and it wasn’t a layout I was familiar with.


Here’s the view of Kim Il-sung Square from the balcony, with the Juche Tower in the background.


Next, we went to my favourite museum: the Victorious Fatherland Liberation War Museum (祖國解放戰爭勝利紀念館).

They had lots of cannons, tanks, submarines, and planes.


I love the northern spelling of “tank” (땅크).


We also visited the captured American ship USS Pueblo.


And we watched this fantastic video. It puts a smile on my face every time.


Then, we entered the main museum. Unfortunately, we weren’t allowed to take pictures inside, but it was a great experience. There was a life-sized model of President Kim Il-sung greeting us in the lobby, and the building looked kind of like a luxury hotel.


I also saw these around the city. They say 「偉大한 金日成同志와 金正日同志는 永遠히 우리와 함께 계신다」 (“The great Comrade Kim Il-sung and Comrade Kim Jong-il are with us forever”) in Korean letters.


After the Victorious Fatherland Liberation War Museum, we drove through the countryside again.


On the way to Pyongsong, we could also see slogans in the countryside. This one says 「榮光스러운 朝鮮勞動黨 萬歲!」 (“Long live the glorious Workers’ Party of Korea!”) in Korean letters.


The hotel in Pyongsong was much smaller than the one in Pyongyang, but very cozy. The staff was super friendly. I was tired, so I went to bed early and missed the karaoke. Bummer, since I love north Korean music and would have loved to join.

Day 5


The next day, we went to a camp that served as a base of operation against the Japanese (or Americans … I forgot which).


On the way back to Pyongyang, we stopped at a candy factory. I bought four bottles of soda, because they were only 5 RMB each. The strawberry one was the best, but the peach one was OK too. The other two weren’t that good.


We also visited Kim Jong-suk Middle School No. 1 (金正淑 第一 中學校). I sang Chollima on the Wing (千里馬 달린다) and No Motherland Without You (當身이 없으면 祖國도 없다) with one of the classes.

There was a strange taxidermy room. The animals were kind of weird.


We left Pyongsong and headed to a “village” of movie backgrounds. There were areas for ancient Asia, South Korea, China, and more.

It was cool to walk through.


I saw only one church building on the trip. Apparently, it is in use.

Then, we went to another art gallery. I talked to a multilingual woman who knew French, among other languages.


After that, we went to visit the famous Yanggakdo International Hotel (羊角島國際호텔).

We had to visit the top floor with the revolving restaurant, of course.


After that, it was time to take the train back to China.


Got a few good pictures of the Korean countryside.


Finally, we reached Tantung (丹東). It was an interesting feeling. I felt safe in the DPRK, but not very free, since we had to stay with the guides at all times.


The train we travelled with didn’t have any toilet paper, but I had brought my own just in case.


One of the guards also had this strange mix of simplified and normal characters on his shoulder. All the other ones had it all in simplified.


I enjoyed the train ride, although it was about 24 hours long from Pyongyang to Peking.

I skipped a couple of things, like the grave of an ancient king near Kaesong and the store selling lots of imported goods, but I didn’t get any good pictures there. I hope I can visit some other places if I come back in the future!

Cantonese Goldlisting Project

My goldlisting project for this year is an average of 100 lines per day (but I’m currently 18 days ahead of schedule), and the language I’m goldlisting is Cantonese. My goal is 15 000 items, but because I need extra lines for readings, I’ll need to reach about 20 000 headlist lines.

Source and Method

I’m using a great list of characters encountered in the Taiwanese school system, finding the CantoDict page for one character at a time, and goldlisting all the “compounds” that seem worthwhile. This usually means I’ll skip:

  • Things I don’t understand the English translation of.
  • Transparent compound words.
  • Proper nouns that I don’t recognize.

The first few characters in the list have huge lists of words containing them, and the last ones may have only one or two. Some are not even in the dictionary, and others are there, but with no words. So in the beginning, I will be using the same list for a long time, but the further I get, the shorter the lists get. In addition, I keep track of what characters I’ve already goldlisted, so I skip any words containing characters I’ve already done, which further shortens the lists.


In calculating project sizes in the goldlist system, David James multiplies the amount of headlist lines by three to get an approximate number of lines. In my case, the entire project should have 60,000 lines in total, although in reality it will likely have less than that. I’m currently at 10,300 headlist lines, but I’ve already started distilling, so I have about 8,000 lines in my distillations too. If I see every item approximately 3 times on average, I’ve finished about 1/3 of the project.

If I keep going at 100 lines per day, the remaining ~40,000 lines should take me ~400 days. But if I can manage to keep going at the current rate of 200 lines per day, I’ll need only ~200 days, which means I could finish before next year, if work and other circumstances allow it.


Initially, I planned on doing the project in 20 batches of 1,000 headlist lines each, and just finish them one by one, but I found out about David’s superior batch system early on and decided to use that instead. If the numbers look scary, don’t worry; I felt completely overwhelmed when I looked at them too. But it’s actually very easy:

  • Write your headlist for the first batch (in my case 2,000 lines).
  • Distil the headlist from 1–2,000, and and continue making the headlist from 2,001–3,900.
  • Return to the beginning of the book, where your D1 (first distillation) starts, and distil it. Now distil the second headlist batch, and finally continue making the headlist from 3,901–5,700.
  • Return to the beginning of the book, where your D2 starts. Distil your way through the book again, distilling one page at a time. Now continue making the headlist from 5,701–6,300.
  • Return to the beginning of the book, where your D3 starts. Distil your way through the whole book. Add 1,500 lines to your headlist.
  • Return to the beginning of the book, where your D4 starts. Distil your way through the whole book. Add 1,400 lines to your headlist.
  • Return to the beginning of the book, where your D5 starts. Distil your way through the whole book. Add 1,300 lines to your headlist.
  • Return to the beginning of the book, where your D6 starts. Distil your way through the whole book. Add 1,200 lines to your headlist.
  • And so on.

Of course, you won’t find a single book that can fit everything, so since my books have 100 sheets and are 35 lines deep, I can fit 2,500 headlist words with three distillations in each if I use the very last page and the very first as if they were a double page. After D3, I have to sample from several pages to make a new list of 25 lines per page in a new book. We call the first book “bronze” and the second “silver” – the next is “gold” and then even “platinum” if you want to keep going, but you may not need to continue once you finish the silver book.

Having explained the system I planned on using, I have to say I ended up not sticking to it after all, and there’s a good reason for that. I can only do the headlist when I have access to my source, and my source is online. Therefore, I decided to do the headlist when I can, since the other distillations can be done anywhere (except when I sample from one book to put into another, but at least I don’t need to be online for that. But I think I will follow the batch system for distillations, and so far, I have.

Books and Pens

I use these 100-page, 35-line Kokuyo Campus notebooks:


Each one can fit 2,500 headlist lines, so I’ll need 8 of them at the bronze stage, and probably 2 at the silver stage. I might just use a smaller book at the gold stage.

I don’t have a specific type of pen that I use, but I try to use comfortable ones. I prefer 0.38 mm., but 0.5 mm. pens are OK as well. If you want to try this at home, make sure you stock up on pens, because this really eats them up! I like to use black for the headlist, blue for D1, red for D2, and green for D3, then black for D4, and so on, rotating the colours, but you can do it with any colour you like.