{"id":63,"date":"2018-07-19T15:23:41","date_gmt":"2018-07-19T15:23:41","guid":{"rendered":"http:\/\/www.lib.uiowa.edu\/data\/organize-data\/file-formats-principles-for-selecting-file-formats\/"},"modified":"2026-02-16T23:25:01","modified_gmt":"2026-02-16T23:25:01","slug":"file-formats","status":"publish","type":"page","link":"http:\/\/www.lib.uiowa.edu\/data\/manage\/file-formats\/","title":{"rendered":"File Formats"},"content":{"rendered":"<p><a href=\"#rec\">Recommended file formats<\/a> | <a href=\"#more\">More information<\/a><\/p>\n<p>Selecting the optimal file format(s) for your data will help ensure that your data will be accessible for future use (your own, and for others). When selecting tools for your data, pay special attention to the output formats of your data.<\/p>\n<h2 id=\"rec\">Recommended file formats<\/h2>\n<p>The <a href=\"https:\/\/www.ukdataservice.ac.uk\/manage-data\/format\/recommended-formats\" target=\"_blank\" rel=\"noopener\">UK Data Service <i class=\"fas fa-external-link-alt\"> <\/i><\/a> has provided the following table of recommended formats.<\/p>\n<div class=\"c-message_attachment__row\"><span class=\"c-message_attachment__author\" data-qa=\"message_attachment_author\"><span class=\"c-message_attachment__part\"><span class=\"c-message_attachment__author_name\" data-qa=\"message_attachment_author_name\">Similarly, the <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/format-pref-summary.html\" target=\"_blank\" rel=\"noopener\">US Library of Congress <i class=\"fas fa-external-link-alt\"> <\/i><\/a> provides a table of preferred and acceptable formats <i class=\"fas fa-external-link-alt\"> <\/i> <\/span><\/span><\/span><span class=\"c-message_attachment__title\"><span dir=\"auto\">for a variety of content types, including: <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/text.html\" target=\"_blank\" rel=\"noopener\">Textual Works <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/stillimg.html\" target=\"_blank\" rel=\"noopener\">Still Image Works <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/moving.html\" target=\"_blank\" rel=\"noopener\">Moving Image Works <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/audio.html\" target=\"_blank\" rel=\"noopener\">Audio Works <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/musical-scores.html\" target=\"_blank\" rel=\"noopener\">Musical Scores <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/data.html\" target=\"_blank\" rel=\"noopener\">Datasets <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/geo-carto.html\" target=\"_blank\" rel=\"noopener\">GIS, Geospatial and Non-GIS Cartographic <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/design3D.html\" target=\"_blank\" rel=\"noopener\">Design and 3D <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/software-videogames.html\" target=\"_blank\" rel=\"noopener\">Software and Video Games <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/webarchives.html\" target=\"_blank\" rel=\"noopener\">Web Archives <i class=\"fas fa-external-link-alt\"> <\/i><\/a> | <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/email.html\" target=\"_blank\" rel=\"noopener\">Email <i class=\"fas fa-external-link-alt\"> <\/i><\/a>.<\/span><\/span><\/div>\n<p><a href=\"http:\/\/www.lib.uiowa.edu\/data\/share\/data-repositories\/\" target=\"_blank\" rel=\"noopener\">Data repositories in your discipline<\/a>, and other preservation and archiving groups may also have guidance or requirements for file formats.<\/p>\n<p>If the types of data and formats you work with are not listed here or above, <a href=\"https:\/\/www.lib.uiowa.edu\/data\/contact\/\">contact us<\/a> for assistance.<\/p>\n<table style=\"width: 100%\" border=\"1\" cellspacing=\"0\" cellpadding=\"7\">\n<tbody>\n<tr class=\"tablesorter-headerRow\" role=\"row\">\n<th class=\"confluenceTh tablesorter-header sortableHeader tablesorter-headerUnSorted\" style=\"width: 278.167px\" role=\"columnheader\" scope=\"col\" data-column=\"0\" aria-label=\"Type of data: No sort applied, activate to apply an ascending sort\">\n<div class=\"tablesorter-header-inner\">Type of data<\/div>\n<\/th>\n<th class=\"confluenceTh tablesorter-header sortableHeader tablesorter-headerUnSorted\" style=\"width: 324.167px\" role=\"columnheader\" scope=\"col\" data-column=\"1\" aria-label=\"Recommended formats: No sort applied, activate to apply an ascending sort\">\n<div class=\"tablesorter-header-inner\">Recommended formats<\/div>\n<\/th>\n<th class=\"confluenceTh tablesorter-header sortableHeader tablesorter-headerUnSorted\" style=\"width: 291.167px\" role=\"columnheader\" scope=\"col\" data-column=\"2\" aria-label=\"Acceptable formats: No sort applied, activate to apply an ascending sort\">\n<div class=\"tablesorter-header-inner\">Other acceptable formats<\/div>\n<\/th>\n<\/tr>\n<\/tbody>\n<tbody>\n<tr>\n<td>Quantitative tabular data with extensive metadata.<\/p>\n<p>A dataset with variable labels, code labels, and defined missing values, in addition to the matrix of data.<\/td>\n<td>Proprietary formats of statistical packages e.g., SPSS (.sav), Stata (.dta),.sas7bdat.Delimited text and command (\u2018setup\u2019) file (SPSS, Stata, SAS, etc.) containing metadata information.<\/p>\n<p>Some structured text or mark-up file containing metadata information, e.g., DDI XML file.<\/td>\n<td>SPSS portable format (.por).<\/p>\n<p>MS Access (.mdb\/.accdb).<\/td>\n<\/tr>\n<tr>\n<td>Quantitative tabular data with minimal metadata.<\/p>\n<p>A matrix of data with or without column headings or variable names, but no other metadata or labeling.<\/td>\n<td>Comma-separated values (CSV) file (.csv).Tab-delimited file (.tab).<\/p>\n<p>Including delimited text of given character set with SQL data definition statements where appropriate.<\/td>\n<td>Delimited text of given character set \u2013 only characters not present in the data may be used as delimiters (.txt).<\/p>\n<p>Widely-used formats: MS Excel (.xls\/.xlsx), MS Access (.mdb\/.accdb), OpenDocument Spreadsheet (.ods).<\/td>\n<\/tr>\n<tr role=\"row\">\n<td>Geospatial data.<\/p>\n<p>Vector and raster data.<\/td>\n<td>ESRI Shapefile (essential \u2013 .shp, .shx, .dbf, optional \u2013 .prj, .sbx, .sbn).Geo-referenced TIFF (.tif, .tfw).<\/p>\n<p>CAD data (.dwg).<\/p>\n<p>Tabular GIS attribute data.<\/td>\n<td>ESRI Geodatabase format (.mdb).MapInfo Interchange Format (.mif) for vector data.<\/p>\n<p>Keyhole Mark-up Language (.kml).<\/p>\n<p>Adobe Illustrator (.ai), CAD data (.dxf or .svg).<\/p>\n<p>Binary formats of GIS and CAD packages.<\/td>\n<\/tr>\n<tr role=\"row\">\n<td>Qualitative data.<\/p>\n<p>Textual.<\/td>\n<td>eXtensible Mark-up Language (XML) text according to an appropriate Document Type Definition (DTD) or schema (.xml).Rich Text Format (.rtf).<\/p>\n<p>Plain text data, ASCII (.txt).<\/td>\n<td>Hypertext Mark-up Language (.html).Widely-used formats: MS Word (.doc\/.docx).<\/p>\n<p>Some software-specific formats: NUD*IST, NVivo and ATLAS.ti.<\/td>\n<\/tr>\n<tr role=\"row\">\n<td>Digital image data.<\/td>\n<td>TIFF version 6 uncompressed (.tif).Digital Imaging and Communications in Medicine (DICOM) (.dcm, .dcm30) \u2013 for CT\/MRI data.<\/td>\n<td>JPEG (.jpeg, .jpg) but only if created in this format.TIFF (other versions) (.tif, .tiff).<\/p>\n<p>Adobe Portable Document Format (PDF\/A, PDF) (.pdf).<\/p>\n<p>Standard applicable RAW image format (.raw).<\/p>\n<p>Photoshop files (.psd).<\/p>\n<p>BMP (.bmp) but only if created in this format.<\/p>\n<p>PNG (.png) but only if created in this format.<\/td>\n<\/tr>\n<tr role=\"row\">\n<td>Digital audio data.<\/td>\n<td>Free Lossless Audio Codec (FLAC) (.flac).<\/td>\n<td>MPEG-1 Audio Layer 3 (.mp3) if original created in this format.Audio Interchange File Format (.aif).<\/p>\n<p>Waveform Audio Format (.wav).<\/td>\n<\/tr>\n<tr role=\"row\">\n<td>Digital video data.<\/td>\n<td>MPEG-4 (.mp4).OGG video (.ogv, .ogg).<\/p>\n<p>motion JPEG 2000 (.mj2).<\/td>\n<td>MOV (.mov)Windows Media Video (WMV) (.wmv).<\/p>\n<p>WebM (.webm).<\/td>\n<\/tr>\n<tr role=\"row\">\n<td>Documentation and scripts.<\/td>\n<td>Rich Text Format (.rtf).PDF\/A or PDF (.pdf).<\/p>\n<p>HTML (.htm).<\/p>\n<p>OpenDocument Text (.odt).<\/p>\n<p>R Markdown files (.rmd) (with HTML version as well).<\/td>\n<td>Plain text (.txt).Widely-used proprietary formats: MS Word (.doc\/.docx), MS Excel (.xls\/.xlsx).<\/p>\n<p>XML marked-up text (.xml) according to an appropriate DTD or schema, e.g. XHMTL 1.0.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><a href=\"https:\/\/ukdataservice.ac.uk\/learning-hub\/research-data-management\/format-your-data\/recommended-formats\/\" target=\"_blank\" rel=\"noopener\">Recommended Formats by the UK Data Service <i class=\"fas fa-external-link-alt\"> <\/i><\/a><\/p>\n<h2 id=\"more\">More about File Formats<\/h2>\n<h3>Use open, non-proprietary file formats<\/h3>\n<p>Open, non-proprietary formats are far more likely to remain usable even if the software that created them is not available or no longer functional. Formats whose documentation is complete and freely available also have a higher likelihood of long-term preservation.<\/p>\n<p>If the program that created the file is the only option for reading or accessing the data, it is likely to be a proprietary, non-open format. As a general rule, plain text formats, such as comma- or tab- delimited files, are open formats and are typically better for re-use and long-term preservation.<\/p>\n<p>Image file examples:<\/p>\n<table style=\"width: 34.3572%\" border=\"1\" cellspacing=\"0\" cellpadding=\"12\">\n<tbody>\n<tr>\n<td style=\"width: 100%\">proprietary format: .psd file<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 100%\">open format: .tiff image file<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>Use &#8220;lossless&#8221; formats<\/h3>\n<p>Formats that compress the information in a file are often smaller, but the compression often permanently removes data from the file. These formats are &#8220;lossy,&#8221; while formats that do not result in the loss of information when uncompressed are &#8220;lossless.&#8221;<\/p>\n<p>Audio and image file examples:<\/p>\n<table style=\"width: 38.9419%\" border=\"1\" cellspacing=\"0\" cellpadding=\"12\">\n<tbody>\n<tr>\n<td style=\"width: 354px\">lossy formats: .mp3 audio file, .jpeg image file<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 354px\">lossless formats: .wav audio file, .tiff image file<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>Use unencrypted and uncompiled formats<\/h3>\n<p>If the encryption key, passphrase, or password to a file is lost, there may be no way to retrieve the data from the file later, rendering it unusable to others.<\/p>\n<p>Uncompiled source code is more readily re-usable by others and has a far greater likelihood of remaining usable over time since recompiling is possible on different architectures and platform<\/p>\n<h3>Want to dig deeper?<\/h3>\n<p>See the <a href=\"https:\/\/www.loc.gov\/preservation\/resources\/rfs\/index.html\" target=\"_blank\" rel=\"noopener\">US Library of Congress information about file formats for preservation <i class=\"fas fa-external-link-alt\"> <\/i><\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Recommended file formats | More information Selecting the optimal file format(s) for your data will help ensure that your data will be accessible for future use (your own, and for [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":471,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"pagetpl-data.php","meta":{"footnotes":"","_links_to":"","_links_to_target":""},"categories":[52],"tags":[89,95,101,102,63,103,104],"class_list":["post-63","page","type-page","status-publish","hentry","category-managing","tag-data-management","tag-data-planning","tag-file-formats","tag-formatting","tag-managing-data","tag-organizing-data","tag-research-data-planning","managing","","data","wp-json","wp","v2","pages","63"],"_links":{"self":[{"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/pages\/63","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/comments?post=63"}],"version-history":[{"count":57,"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/pages\/63\/revisions"}],"predecessor-version":[{"id":3173,"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/pages\/63\/revisions\/3173"}],"up":[{"embeddable":true,"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/pages\/471"}],"wp:attachment":[{"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/media?parent=63"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/categories?post=63"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.lib.uiowa.edu\/data\/wp-json\/wp\/v2\/tags?post=63"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}