UAST: Unicode Aware Sanskrit Transliteration

03/27/2022
by   Aneri Dalwadi, et al.
0

Devanagari is the writing system that is adapted by various languages like Sanskrit. International Alphabet of Sanskrit Transliteration (IAST) is a transliteration scheme for the romanization of the Sanskrit language. IAST makes use of diacritics to represent various characters. On a computer, these are represented using Unicode standard which differs from how the Sanskrit language behaves at a very fundamental level. This results in an issue that is encountered while designing typesetting software for Devanagari and IAST. We hereby discuss the problems and provide a solution that solves the issue of incompatibilities between various transliteration and encoding schemes. Implementation and source code available at https://github.com/dhruvildave/uast

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/25/2021

MadDog: A Web-based System for Acronym Identification and Disambiguation

Acronyms and abbreviations are the short-form of longer phrases and they...
research
09/11/2023

Tortoise: An Authenticated Encryption Scheme

We present Tortoise, an experimental nonce-based authenticated encryptio...
research
06/11/2023

E(2)-Equivariant Vision Transformer

Vision Transformer (ViT) has achieved remarkable performance in computer...
research
07/11/2023

Duncode Characters Shorter

This paper investigates the employment of various encoders in text trans...
research
01/17/2022

OmniPrint: A Configurable Printed Character Synthesizer

We introduce OmniPrint, a synthetic data generator of isolated printed c...
research
06/18/2021

A transformation-based approach for solving stiff two-point boundary value problems

A new approach for solving stiff boundary value problems for systems of ...
research
05/04/2023

What changes when you randomly choose BPE merge operations? Not much

We introduce three simple randomized variants of byte pair encoding (BPE...

Please sign up or login with your details

Forgot password? Click here to reset