TSV (Tab-Separated Values)
TSV is a tabular data format where columns are separated by tab characters instead of commas. It avoids the quoting complexities of CSV since tabs rarely appear in data values, making TSV simpler to parse for many bioinformatics and data science workflows.
MIME Type
text/tab-separated-values
Type
Text
Compression
Lossless
Advantages
- + Simpler parsing than CSV โ no quoting rules needed
- + Tabs rarely appear in data, reducing ambiguity
- + Standard in bioinformatics and scientific data pipelines
Disadvantages
- โ Less universally recognized than CSV in business tools
- โ Cannot represent values containing tab characters
- โ Some spreadsheet tools default to CSV, not TSV
When to Use .TSV
Use TSV when data values frequently contain commas, or in scientific and bioinformatics pipelines where TSV is the convention.
Technical Details
TSV files use a tab character (\t) as the column delimiter and newline as the row delimiter. Unlike CSV, TSV does not require quoting โ fields simply cannot contain tabs or newlines.
History
Tab-delimited files have been used since the earliest mainframe data processing. TSV became formalized as the IANA media type text/tab-separated-values and is commonly used in bioinformatics (BLAST output), linguistics, and data warehousing.