Download token types

NAME CONTENT(REGEX) COMMENTS
TIME_REFERENCE (?i)(?:((A\.?M\.?|P\.?M\.?)/[^\s]+)|((GMT|DST|BST|EST|UTC|SGT)/[^\s]+)|((A\.?D\.?|B\.?C\.?)/[^\s]+)) Some temporal references such as time zone and the Christian era
DAY_TIME (?i)(?:(morning|noon|afternoon|evening|dusk|twilight|dawn|daybreak|sunrise|sunup|sunset|sundown|midday|midnight|teatime|lunchtime|dinnertime|suppertime|daylight|daybreak|daytime|nighttime|night|overnight|tonight)/[^\s]+) Some periods within a day
WEEK (?i)(?:(Monday|Tuesday|Wednesday|Thursday|Friday|Saturday|Sunday|Mon\.?|Tues?\.?|Wed\.?|Thurs?\.?|Fri\.?|Sat\.?|Sun\.?)/[^\s]+) Seven days of a week
MONTH (?i)(?:(January|February|March|April|May|June|July|August|September|October|November|December|Jan\.?|Feb\.?|Mar\.?|Apr\.?|May\.?|Jun\.?|Jul\.?|Aug\.?|Sept?\.?|Oct\.?|Nov\.?|Dec\.?)/NNP?S?) Twelve months of a year
SEASON (?i)(?:(spring|summer|autumn|fall|winter)/NNP?S?) Four seasons of a year
YEAR (?i)(?:[^\s]+/YEAR) Actually the regex "[1-2][0-9]{3}|'[0-9]{2}", representing a particular year
TIME (?i)(?:[^\s]+/TIME) Actually the regex "[0-2]?[0-9]:[0-5]?[0-9]:[0-5]?[0-9]|[0-2]?[0-9]:[0-5]?[0-9]", representing the clock time
DATE (?i)(?:[^\s]+/DATE) Actually the regex "(?:[1-9]|[0-3][0-9])[-/.](?:[1-9]|[0-3][0-9])[-/.][1-2][0-9]{3}|[12][0-9]{3}[-/.](?:0?[1-9]|1[0-2])[-/.][0-3][0-9]|[1-9]{2}[-/.](?:0?[1-9]|1[0-2])[-/.][0-3][0-9]", representing dates like "1949-10-01"
DECADE (?i)(?:[^\s]+/DECADE) Actually the regex "twenties|thirties|forties|fifties|sixties|seventies|eighties|nineties|([1-2]?[0-9])?[0-9]0s", representing a decade of a century
HOLIDAY (?i)(?:[^\s]+/HOLIDAY) Actually the regex "christmas|xmas|easter|halloween|thanksgiving|new year|election day|inauguration day|groundhog day|valentine day|patrick day|memorial day|columbus day|father's day|mother's day|valentine's day|patrick's day|fools' day"
TIME_UNIT (?i)(?:(era|millennium|century|decade|year|month|fortnight|week|weekday|weekend|day|hour|minute|millisecond|o'clock|quarter|second)/NNP?S?) The unit of time
TIMELINE (?i)(?:(now|currently|recently|previously)/RB) Some adverbs representing time
MOMENT (?i)(?:(moment|past|future)/NNP?S?) Words "moment", "past" and "future"
DAYPOINT (?i)(?:(yesterday|today|tomorrow)/NNP?S?) Days containing "yesterday", "today" and "tomorrow"
PERIOD (?i)(?:(centennially|yearly|annually|hourly|nightly|daily|weekly|monthly|quarterly)/RB) Something happens regularly
EARLY_LATE (?i)(?:(earlier|later)/RBR?|(before|after|ago|early|late)/RB) Words which represent early or late
POSITION (?i)(?:(start|beginning|middle|end|rest)/NNS?) Position of a period or an event
EXTENSION (?i)(?:so/RB|beyond/IN) An extension of time
LINKS (?i)(?:(mid|previous|last|current|next|following|coming|upcoming|past|future|initial|final|late|early|latter|former|present|medieval|recent)/JJ|(earlier|later)/JJR|(earliest|latest)/JJS) Some adjectives in common use
ORDINAL (?i)(?:([2-9]?1st|[2-9]?2nd|[2-9]?3rd|[4-9]th|1[0-9]th|[2-9]0th|[2-9][4-9]th|100+th|zeroth|first|second|third|fourth|fifth|sixth|seventh|eighth|ninth|tenth|eleventh|twelfth|twentieth|thirteenth|fourteenth|fifteenth|sixteenth|seventeenth|eighteenth|nineteenth|hundredth|thousandth|(twenty|thirty|forty|fifty|sixty|seventy|eighty|ninety)-(first|second|third|fourth|fifth|sixth|seventh|eighth|ninth))/(JJ)) Ordinal numbers
FREQUENCE (?i)(?:(each|every|per)/[^\s]+) Representing regularity
APPROXIMATION (?i)(?:(around|about)/IN|(roughly|approximately|nearly|almost)/RB|(approximate|near)/JJ) Representing approximation
DEFINITE_DETERMINER (?i)(?:(the|this|that|these|those)/W?DT) Definite determiner
INDEFINITE_DETERMINER (?i)(?:(some|many|another|any|all)/P?DT) Indefinite determiner
INDEFINITE_QUANTITY (?i)(?:[^\s]+/IQ) Actually the regex "a few of|a couple of|a dozen of|a few|a couple|a dozen|several|few|couple|dozen"
NUMBER (?i)(?:[^\s]+/CD) Numbers
SUCCESSIVE_RELATION (?i)(?:(before|after)/IN) Prepositions "before" and "after"
CONJUNCTION (?i)(?:(and|or)/CC) Conjunctions "and" and "or"
RELATION (?i)(?:to/(IN|TO)|(from|between|during|as|in|on)/IN) Prepositions which represent relations
OF (?i)(?:of/[^\s]+) The word "of"
POS (?i)(?:('s/[^\s]+)|([^\s]+/POS)) Possessive case
INEQUALITY (?i)(?:[^\s]+/INEQ) Actually the regex "no more than|no less than|at least|at most|more than|less than|up to|close to|over|a mere"

back