I Can't Read This
Genealogy 101: A Practical Guide to Deciphering Old Documents
GENEALOGY 101 | GENEALOGY BY ARYN
Practical Skills for Every Researcher
You found it.
Finally! After hours of searching, or maybe days, weeks, months, or even years! There it is—the document you were looking for.
Your great-grandmother’s death certificate.
Your ancestor’s immigration record.
A letter in a handwriting so dense and looping that it looks more like a landscape than a language. A page of characters you don’t recognize at all.
And now you can’t read it.
The moment is equal parts triumph and frustration, and it’s one of the most common experiences in genealogical and family history research.
The record exists.
You thought outside the box, and finally it.
But it’s locked behind a script you never learned, a language your family stopped speaking two generations ago, or the particular handwriting of a 19th-century government clerk who apparently thought legibility was optional.
But here is the good news:
Document decipherment is a learnable skill.
There are free tools, active online and in-person communities, proven strategies, and resources that can crack almost anything. In this installment of Genealogy 101, I will walk you through all of it, from wobbly English handwriting to Chinese characters to Latin church records, so that the next time a document stops you cold, you know exactly where to turn.

Part One: Old English Handwriting
Let’s start with the records most researchers encounter first:
Documents written in English, but in a handwriting style so different from modern writing that they’re nearly illegible.
Census records
Vital records
Wills
Deeds
Military records
Personal letters
These types of documents from the 18th, 19th, and early 20th centuries were written in scripts that were standard in their time but look foreign today.
Why Old Handwriting Is So Hard to Read
Several factors make historical English handwriting difficult for modern readers.
The first is script style.
Before the widespread adoption of standardized print-style writing in the early 20th century, most literate people wrote in one of several cursive scripts — copperplate, Spencerian, secretary hand, or older styles used in colonial-era documents.
Each of these styles came with its own conventions for letter formation. Letters that look like one thing in modern handwriting look entirely different in these scripts.
The second factor is ink and paper degradation.
Old documents have often faded, foxed (developed brown spots from aging), torn, or been damaged by water. What was once a clear stroke may now be a shadow.
The third factor is individual variation.
Every clerk, minister, enumerator, and letter-writer had their own hand. A capital F in one person’s writing might look like a capital J or T in another’s. Context is everything.
The Most Important Thing You Can Do: Learn the Alphabet
Before you try to read any old document, find an alphabet guide for the script style you’re dealing with. Many historical scripts have standard letter forms that, once you learn them, make the rest of the document far more legible. You don’t need to master the script. You just need to recognize the letters.
FamilySearch has published free guides to historical handwriting styles organized by country and time period, including detailed letter-by-letter alphabet charts. These are the single most practical starting resources for any researcher encountering an unfamiliar script.
Strategy: Work Around the Words You Can’t Read
When a word won’t resolve, don’t stare at it—work around it.
Read the words you can read first and use context to narrow down what the unclear word must be. A county clerk writing a death certificate in 1887 had a limited vocabulary of standard terms.
If you can read “cause of” and “aged” and “female,” the words in between become much easier to infer.
Also, look for the same word elsewhere in the document or in similar documents from the same source. Clerks were consistent. If you can find a legible example of the word “consumption” in one record and match it to the illegible one in yours, you’ve solved it.
Strategy: Transcribe Exactly What You See, Then Interpret
One of the most common mistakes beginners make is trying to read and interpret at the same time. Instead, make two passes.
On the first pass, write down exactly what you see—letter by letter, even if the result looks like nonsense.
On the second pass, look at that phonetic transcription and ask what word it could be. “Consupmtion” becomes “consumption.” “Pensilvania” becomes “Pennsylvania.” “Jno” — a standard abbreviation you’ll encounter constantly — is “John.”
Common abbreviations worth memorizing:
Jno = John
Wm = William
Thos = Thomas
Richd = Richard
Chas = Charles
Eliz or Eliza = Elizabeth
do. or ditto = same as above (appears frequently in census and ledger records)
inst. = of this month (”the 14th inst.” means the 14th of this month)
ult. = of last month
aet. or aetat. = aged (from Latin aetatis)
Transkribus — AI-Powered Handwriting Recognition
Transkribus is an AI-assisted transcription platform developed by the READ-COOP cooperative in Europe and used by archives, universities, and individual researchers worldwide. You upload a document image, select a trained model for the script type you’re working with, and the platform produces a machine-generated transcription that you can then correct.
Transkribus is not perfect, no AI handwriting recognition is, but it is remarkably good on many common historical scripts, and it can produce a working draft transcription in minutes that would take hours by hand. It requires a free account and uses a credit system; casual users get enough free credits to work through a meaningful number of documents.
If you don’t want to pay for Transkribus — try Claude or Gemini. Sometimes, all we need is a jumping-off point. By giving us a head start, AI can remove the anxiety associated with the difficulty of reading old scripts.
Crowdsourced Transcription Communities
Sometimes the best tool is another human who has seen thousands of documents like yours. There are several online communities that exist specifically to help researchers decipher stubborn handwriting:
Genealogy Translations (Facebook group) — a large, active community where you can post a document image and receive transcription help, often within hours. Covers handwritten English as well as many foreign languages.
r/Genealogy (Reddit) — the Deciphering Help thread within this subreddit regularly helps researchers with handwriting questions. Search before posting; your exact document type may have been discussed before.
Ancestry Community Forums — Ancestry’s built-in community forums include transcription help threads organized by record type and country.
FamilySearch Community — similar community forum structure with active volunteer helpers.
There are others, like genealogical societies or even local libraries, that may be able to offer you help deciphering old documents. Look around where you live, search online, or even ask on here (in the comments below) — you never know who you may find to help you out.
Documents in Other Languages
While old handwriting in a language you know may be one wall to climb, it’s important to remember you may locate one in a language you don’t know.
Your first reaction may be, all hope is lost, but finding a document in a language you don’t speak is not necessarily a dead end. It is an extremely common research situation, and there are tools and strategies for virtually every language you’re likely to encounter in genealogical research.
The approach varies by language, so this section covers the most frequently encountered ones for researchers in the United States.
First: Identify the Language
This sounds obvious, but it’s worth stating explicitly—before you can translate a document, you need to know what language it’s in. For many researchers, this is straightforward, but historical documents sometimes surprise you.
A family from Alsace might have records in German, French, or both, depending on which country controlled the region when the document was created.
A Filipino ancestor’s church records might be in Spanish.
A Jewish ancestor’s records from Eastern Europe might be in Russian, Polish, Yiddish, Hebrew, or some combination.
If you genuinely can’t identify the language, the Genealogy Translations Facebook group can usually identify it within minutes from a photo. You can also use Google Translate’s camera feature on your phone or in your browser to attempt to identify the language even before translating.
Spanish Records
Spanish-language documents are among the most common non-English records encountered by American genealogists and family historians, particularly for families with roots in Latin America, the Philippines, the American Southwest, or California (where Spanish colonial and mission records predate American statehood). California mission records, in particular, are a largely accessible source for families with deep roots in the region.
Spanish genealogical documents follow consistent structures and use a relatively limited vocabulary of standard terms. Learning a core list of genealogical Spanish terms:
Bautismo (baptism)
Defunción (death)
Matrimonio (marriage)
Padrinos (godparents)
Legítimo/a (legitimate)
Natural de (native of)
These terms will allow you to pull the key information from most vital records, even with minimal Spanish.
German Records
German-language records are the second most common foreign-language documents encountered by American genealogists after Spanish, due to the size of 19th-century German immigration to the United States.
German genealogical records are complicated by several factors — the Gothic (Kurrent and Sütterlin) handwriting scripts used in Germany until the mid-20th century are radically different from both modern German handwriting and from any English script; German records were kept by individual German states rather than a central government, so record types and availability vary significantly by region; and many records are held in German regional archives that are not fully digitized.
The Kurrent script, in particular, is a serious obstacle for most English-speaking researchers. It looks almost nothing like modern Roman letters, and attempting to read it without an alphabet guide first is a frustrating exercise.
Here are two resources to help you out:
Kurrent Handwriting Alphabet Chart Free letter-by-letter alphabet guide for Kurrent and Sütterlin scripts, the two Gothic scripts used in German records.
Archion and Matricula Two major databases of digitized German church records. Archion requires a paid subscription; Matricula is free for many collections.
Latin Records
If your family has roots in Catholic communities anywhere in the world—Italy, Ireland, Poland, Mexico, the Philippines, Latin America—there is a good chance you will eventually encounter records written in Latin.
Catholic church records (baptisms, marriages, burials) were kept in Latin well into the 19th century in many regions, and some parishes continued using Latin into the 20th century.
The good news about Latin genealogical records is that they are highly formulaic. The same phrases appear in the same positions in every baptismal register, every marriage record, every burial entry. You do not need to learn Latin as a language to work with these records. You need to learn a relatively small set of recurring Latin phrases and a handful of grammatical conventions.
Key Latin genealogical terms to memorize:
baptizavi / baptizatus est = I baptized / was baptized
natus/nata = born (male/female)
filius/filia = son/daughter
legitimus/legitima = legitimate
patrini = godparents
conjuges = spouses / married couple
sepultus/sepulta = buried (male/female)
obiit = died
uxor = wife
vidua/viduus = widow/widower
annorum = of years (”annorum 45” = aged 45)
FamilySearch Latin Genealogical Word List Free. Comprehensive glossary of Latin terms found in Catholic and civil records, with English translations.
Chinese Records
Researching Chinese-language documents is a distinct challenge because it requires not just translation but knowledge of which type of Chinese writing system you’re dealing with, which dialect the document was produced in, and what era it dates from—all of which affect how characters should be read and interpreted.
For most Chinese American researchers, the documents you’ll encounter fall into a few categories:
Jiapu (clan genealogy books, typically in Classical Chinese),
Immigration interrogation transcripts (in English but referencing Chinese place names and personal names),
Coaching books (handwritten in Cantonese or standard Chinese), and
Official documents from the Exclusion Era, such as Certificates of Identity
For jiapu and other Classical Chinese documents, Google Translate’s camera function is a reasonable first pass. It reads simplified and traditional Chinese characters and handles classical text with variable but often useful accuracy.
For more precise translation, the following resources are specifically calibrated for genealogical Chinese:
My China Roots Genealogy research service with particular expertise in Cantonese and Guangdong Province records. Offers translation services and village record research.
FamilySearch Chinese Records Collection Digitized jiapu and civil records, with an ongoing indexing project. Most useful for researchers with Chinese-language reading ability or a translator.
Japanese Records
Japanese genealogical documents present a particular challenge because Japanese uses three writing systems: hiragana, katakana, and kanji, and older documents use a classical form of Japanese with archaic characters and grammatical structures not found in modern Japanese.
The koseki (family registry system), which is the primary genealogical record source for Japanese families, is written in formal Japanese with standard structures that become readable with practice, but only with some Japanese-language foundation or specialist help.
For researchers without Japanese-language skills, the most practical approach is to identify the specific record type first, then use a targeted resource:
Densho Encyclopedia — Japanese American Records Comprehensive, free reference on Japanese American history and records, including guidance on interpreting specific document types from the incarceration era and earlier.
Japan GenWeb (WorldGenWeb) Links to databases of Japanese immigrants to the U.S. between 1887 and 1924, and guidance on Japanese records by prefecture.
Google Translate Camera Function Reasonably effective on modern Japanese; less reliable on classical or archaic forms. Use as a first pass, not a final translation.
Italian Records
Italian genealogical records are among the most accessible of any non-English language for American researchers, for two reasons: Italy has a highly organized civil registration system dating from the early 19th century, and a large proportion of Italian vital records have been digitized and made available through Antenati — the Italian National Archives portal — for free.
Italian records use a consistent formulaic language that is relatively easy to work through with a basic glossary, even without Italian-language skills. The most important thing to know about Italian records is that civil registration in Italy began in different years depending on the region: in the south and Sicily (where the majority of Italian Americans’ ancestors originated), Napoleonic-era civil registration began around 1809, and records from that period onward are generally available.
Antenati — Italian National Archives Free access to digitized Italian civil vital records from most provinces. Organized by comune (municipality). One of the most powerful free genealogy resources in the world.
FamilySearch Italian Genealogical Word List Free glossary of Italian genealogical terms with English translations.
Yiddish, Hebrew, and Eastern European Jewish Records
For researchers tracing Jewish ancestry from Eastern Europe, documents may appear in Yiddish (written in Hebrew characters but a distinct language), Hebrew (used in religious documents), Russian, Polish, Ukrainian, Romanian, or German, depending on the country and era.
The record landscape for Eastern European Jewish genealogy is complex but has been dramatically expanded by several dedicated organizations.
JewishGen The primary international resource for Jewish genealogy. Includes databases, town finders, translation tools, and active research communities. Free membership with access to most tools.
YIVO Encyclopedia of Jews in Eastern Europe Free scholarly encyclopedia of Eastern European Jewish history. Invaluable for understanding the historical context of specific towns, regions, and record systems.
National Yiddish Book Center: Translation Resources Resources for Yiddish-language documents including glossaries and translator referrals.
General Tools That Work Across Languages
Beyond language-specific resources, several general tools are useful for deciphering documents regardless of what language they’re in.
Google Translate — Camera Mode Is Your First Pass
The Google Translate mobile app’s camera function, accessed by tapping the camera icon, lets you point your phone at text and see a real-time translation overlay. This is not a precision tool.
It makes mistakes, struggles with handwriting, and is inconsistent in its use of archaic language. But it is free, instant, and often good enough to tell you what type of document you’re looking at and what the basic content is, which is all you need to decide whether to invest more time in a deeper translation.
For typed or printed documents in modern scripts, Google Translate's camera mode is often surprisingly accurate. For handwritten documents, treat their output as a rough draft to be verified rather than a finished translation.
DeepL — Better Than Google for European Languages
DeepL (deepl.com) is a machine translation service that consistently outperforms Google Translate for accuracy on European languages, particularly German, French, Italian, Spanish, Portuguese, Polish, and Dutch. If you have a typed or printed document in one of these languages and you want a higher-quality machine translation, DeepL is the tool to use. It is free for documents up to a certain length, with paid plans for heavier use.
Note that DeepL, like Google Translate, is calibrated for modern language usage and struggles with archaic, classical, or highly formulaic historical text. It is most useful for personal letters, typed official documents from the 20th century, and any historical document written in a relatively modern register.
FromThePage — Collaborative Transcription Platform
FromThePage (fromthepage.com) is a platform used by archives, libraries, and universities to manage collaborative transcription projects. Many major institutions — including the Smithsonian, the National Archives, various state historical societies, and university libraries — use FromThePage to host transcription projects in which volunteers work through their collections, document by document.
This matters to you as a researcher for two reasons. First, if an institution has a transcription project for a collection you’re researching, the documents may already be partially or fully transcribed — search the FromThePage public project list before starting from scratch. Second, the platform’s interface is excellent for working through your own documents, and some institutions allow researchers to create private projects.
The Handwriting Wizard — Letter-by-Letter Practice
The Handwriting Wizard (handwritingwizard.com) and similar tools offered by some genealogy societies walk you through historical handwriting styles with interactive exercises. Rather than just showing you an alphabet chart, these tools give you practice documents and ask you to identify letters and words. The learning is significantly faster when it’s active rather than passive, and fifteen minutes of structured practice with a historical script will do more for your ability to read it than an hour of staring at a confusing document.
The Document’s Own Context: Your Best Decoding Tool
No tool in this list is more powerful than the document itself. Specifically, its own internal repetition. Historical records repeat. A census enumerator covering a township wrote the same county name, the same birthplace, and the same occupation hundreds of times. Find a word you can read clearly, then find the same word written elsewhere in the document by the same hand. Use those clear examples as your cipher key.
Similarly, look for names you already know. If you’re searching for your ancestor Maria Consuelo Rodriguez and you find her name written clearly in one place, use those letter forms to decode other words in the same document.
Every legible word is a key that unlocks others.
Putting It Together
When you encounter a document you can’t read, work through it in this order:
Step 1 — Identify the document type and language. What kind of record is this? What country and time period is it from? What language is it likely in? This determines which tools you reach for.
Step 2 — Find the right alphabet or vocabulary guide. Before reading the document, equip yourself with a reference for the script or language. FamilySearch has guides for the most common scripts and languages.
Step 3 — Transcribe what you can see, exactly as you see it. Don’t interpret yet — just capture. Use brackets for letters you’re uncertain about: [c?] means “this looks like a c, but I’m not sure.”
Step 4 — Use context to fill in gaps. Read the words you can read, use the document type’s standard vocabulary, and let the surrounding text guide your interpretation of unclear words.
Step 5 — Use a machine translation tool to produce a first-pass draft of foreign-language documents. Google Translate camera, DeepL, or the FamilySearch word list for the specific language.
Step 6 — Post to a transcription community if you’re still stuck. The Genealogy Translations Facebook group, r/Genealogy, and the FamilySearch community forums can usually resolve a stubborn word or phrase within hours.
Step 7 — Cross-reference your translation against related documents. A name or place that appears unclear in one document may be spelled out legibly in another from the same family or the same archive.
Every word you successfully read is a key that unlocks the next one. Start with what you can see, and let the document teach you the rest.
Save Your Work
When you successfully transcribe or translate a document, save both the original image and your transcription together, labeled clearly. Future researchers — including you, three years from now, when you’ve forgotten what you figured out — will need both. Store the image in its original resolution. Write your transcription in a plain text or word processing document kept in the same folder. Note the source, the date you accessed it, and any uncertainties in your transcription.
The work of deciphering a document is often painstaking. It deserves to be preserved in a form that makes it usable.
And when you hit a word that truly won’t yield, when you’ve tried every tool on this list, and the handwriting is still defying you, write it down as your best guess in brackets, note your uncertainty, and keep moving. One illegible word in an otherwise legible document is not a failure. It is an open question, and open questions in genealogy have a way of answering themselves when you least expect it.
Resources at a Glance
FamilySearch Handwriting & Language Guides — familysearch.org/en/wiki/Historical_Handwriting
Transkribus (AI transcription) — transkribus.eu
Genealogy Translations (Facebook group) — facebook.com/groups/genealogytranslations
DeepL Translator — deepl.com
FromThePage (collaborative transcription) — fromthepage.com
Antenati (Italian archives) — antenati.san.beniculturali.it
JewishGen — jewishgen.org
Densho Encyclopedia (Japanese American records) — encyclopedia.densho.org
My China Roots (Chinese/Cantonese records) — mychinaroots.com
Google Translate App (camera mode) — free, iOS and Android and online
r/Genealogy Deciphering Help — reddit.com/r/Genealogy
Have a document that’s got you stumped? genealogybyaryn.com
Genealogy by Aryn | genealogybyaryn.com | Board Member, Los Angeles City Historical Society | Member of the National Genealogical Society & the Southern California Genealogical Society












This is a wonderful resource! While I have used some of these you have provided language sources that I need for helping someone in researching their family & will find the Japanese helps very useful. Thanks for providing such a complete guide. It’s going into my resource notebook!
Doris Miller-Yetton
Thanks for this resource. I'm putting together my own blog post about transcription and will include this one in the list of resources.