Allows UTF-8 characters in the ranges 0020-007E, 00A0-024F and 1E00-1EFF, which are Basic Latin (excluding the C0 control characters), Latin-1 (excluding the C1 control characters), Latin Extended A, Latin Extended B, and Latin Extended Additional.
Complex pattern for postcode, which matches definition, accepted by some parsers is: "(GIR 0AA)|((([A-Z-[QVX]][0-9][0-9]?)|(([A-Z-[QVX]][A-Z-[IJZ]][0-9][0-9]?)|(([A-Z-[QVX]][0-9][A-HJKSTUW])|([A-Z-[QVX]][A-Z-[IJZ]][0-9][ABEHMNPRVWXY])))) [0-9][A-Z-[CIKMOV]]{2}). Less restrictive HESA format is [A-Z]{1,2}[0-9R][0-9A-Z]?( [0-9][ABD-HJLNP-UW-Z]{2})?."