Please read the following text for background:
Linguistics: the study of language, language meaning, and language context
A document contains characters which form words that form terms that form concepts.
Linguists say characters are either a phoneme or a part of a phenome, the most basic element that a language is based on.
A words is the smallest element that when isolated has meaning.
Terms are words or compounded words such as "milk" or "ice cream"
A concept is a general idea
Because humans do not put out any more exertion than is needed, our language is synoptic meaning that every signal counts.
A quick computer science glossary for quick review
A regular expression is a way of matching a string of text
An algorithm is the step-by-step or systematic processes of how to do a compilation task