papers.adligo.com

StringNormalization

Author: Scott Morgan
Created: 2025-11-25
Edited: 2025-12-02
Id: 1.3.6.1.4.1.33097.5.1
Copywrite 2025 Adligo Inc

Where RelationalNormalForms#1.3.6.1.4.1.33097.5.0 are mostly concerned with de-duplicating arbitrary large Datums#1.3.6.1.4.1.33097.5.0.1.6. StringNormalization is concerned with de-duplicating individual strings of characters.

1SNF String Normal Form

1.3.6.1.4.1.33097.5.1.1

Normalization of Strings is a simple two step process;

a.trim()
b.toLowerCase()

Example of working code

The following code can be executed to illustrate this concept at https://jsfiddle.net/;


let a = 'John '
let b = ' John'
let c = 'John'
let d = 'john'

if (a === b) {
  console.log('a === b is True');
} else {
  console.log('a === b is False');
}

if (a === c) {
  console.log('a === c is True');
} else {
  console.log('a === c is False');
}

if (a === d) {
  console.log('a === d is True');
} else {
  console.log('a === d is False');
}

// you get the idea
if (a.trim().toLowerCase() === d) {
  console.log('a normalized === d is True');
} else {
  console.log('a normalized === d is False');
}

// you get the idea
if (b.trim().toLowerCase() === d) {
  console.log('b normalized === d is True');
} else {
  console.log('b normalized === d is False');
}

Picture of Code Example Running in a jsfiddle.net browser page

1WNF Word Normal Form

1.3.6.1.4.1.33097.5.1.2

Often when preparing string data for inclusion in generative AI systems the punctuation (i.e. ‘.’,’?’,’!’, etc) also needs to be removed from 1SNF string normal form.

Questions Comments:

Citations