About a year ago, I worked on my home-project that has relation to image compression. I used my favorite framework - Qt - and things were going well.
Until I didn’t have to plot some statistics data. If I were used Python, I would just import matplotlib library, add several lines of code and - whoop - get neat diagrams! However I used C++ and Qt. In addition, I didn’t want to make this task too complicated. Therefore, I decided that the simplest way for me was to export my data to csv-file, open it with Microsoft Excel or LibreOffice Calc and create plots.
I don’t intend to describe here the last part of this task - creation of plots in the office suite. There are bunch of good articles and videos on this topic in the Internet. Instead I want to concentrate on the csv-stuff: how to read and write csv files using Qt.
Ok, let’s define our task in more detail. For example, we have a list of pixel objects and we want to create a histograms for each component of those pixels. Each pixel have three channels: red, green and blue. So this data could be represented as this table:
| red | green | blue | 
|---|---|---|
| 11 | 255 | 0 | 
| 167 | 209 | 89 | 
| 34 | 55 | 12 | 
| … | ….. | …. | 
How we can export such table to csv-file? First, let’s see what is a csv-file. csv-file is a text file with quite a simple structure. Each line in csv-file is file interpreted as a single row of a table. Elements (or cell values) in the each row are separated with separator symbol (comma or tab or any other separator, even “avb!@;;”). So our table in csv-file will looks like this (with comma as separator symbol):
red,green,blue
11,255,0
167,209,89
34,55,12
...
Good. Next question - how to create such file programmatically? No problem, all the necessary code you’ll find in this example project:
https://github.com/iamantony/csv-read-write-example
Here is the code from main.cpp file:
#include <QList>
#include <QColor>
#include <QString>
#include <QStringList>
#include <QFile>
#include <QTextStream>
#include <QDebug>
QList<QRgb> GetPixels()
{
    QList<QRgb> pixels;
    pixels << qRgb(11, 255, 0) << qRgb(167, 209, 89) << qRgb(34, 55, 12);
    return pixels;
}
QList<QStringList> PixelsToStrings(const QList<QRgb>& pixels)
{
    QList<QStringList> strings;
    for (int i = 0; i < pixels.size(); ++i)
    {
        QStringList values;
        values << QString::number(qRed(pixels.at(i))) <<
                  QString::number(qGreen(pixels.at(i))) <<
                  QString::number(qBlue(pixels.at(i)));
        strings << values;
    }
    return strings;
}
void WriteToCSV(const QList<QStringList>& pixels)
{
    // Open csv-file
    QFile file("pixels.csv");
    file.open(QIODevice::Append | QIODevice::Text);
    // Write data to file
    QTextStream stream(&file);
    QString separator(",");
    for (int i = 0; i < pixels.size(); ++i)
    {
        stream << pixels.at(i).join(separator) << endl;
    }
    stream.flush();
    file.close();
}
QList<QStringList> ReadCSV()
{
    // Open csv-file
    QFile file("pixels.csv");
    file.open(QIODevice::ReadOnly | QIODevice::Text);
    // Read data from file
    QTextStream stream(&file);
    QList<QStringList> data;
    QString separator(",");
    while (stream.atEnd() == false)
    {
        QString line = stream.readLine();
        data << line.split(separator);
    }
    file.close();
    return data;
}
void Print(const QList<QStringList>& data)
{
    for (int i = 0; i < data.size(); ++i)
    {
        qDebug() << data.at(i).join(", ");
    }
}
int main()
{
    QList<QRgb> pixels = GetPixels();
    QList<QStringList> pixelsStr = PixelsToStrings(pixels);
    WriteToCSV(pixelsStr);
    QList<QStringList> readData = ReadCSV();
    Print(readData);
    return 0;
}
Code, that create and write csv-file is located in function WriteToCSV(). It’s very short, as you see. If you compile this project and run it, in project folder (or in build folder) will be created file with name pixels.csv. You can open it with your favorite text editor, you will see the desired numbers.
Reading of the csv-file isn’t a difficult task (also). In function ReadCSV() we open csv-file, read it line by line and split these lines by separator symbol. As the output, we get list of lists of strings - elements of the table.
You can use csv-files as a method to export data from your program and as a method to import data into your program. Very useful!
Code that I posted here for reading and writing csv-files is Ok. But it have some drawbacks.
Use only one type of data - strings. I mean, functions use list of strings as output/input objects, so before using these functions you must transform your data (it could be integers, floats or other complex objects) to strings by yourself.
High memory consumption.
2.1. When you read the csv-file, all it’s content will be loaded to the memory. It’s Ok when file is small, but what if it has hundreds of thousands of lines? It’s highly possible that eventually you’ll run out of memory and your program will crush.
2.2. When you write to the csv-file all your data primarily have to be converted to strings. It is OK if your data contains only strings, so no conversion is needed. Otherwise, in memory you’ll have two identical copies of your data (original and stringified). After that when you stream your data to the file it will be at first saved to some string buffer, checked, converted and only after that will be written to the file.
To eliminate this drawbacks (well, most of them) I wrote small library - qtcsv. It has class Reader that can read csv-files. It has class Writer that can write csv-files. Also it has several container classes for data that is going to be readed from or written to csv-file. Let’s examine it in more detail.
qtcsv library have three container classes. AbstractData is a pure abstract class that provide interface for a concrete container classes StringData and VariantData. It has only basic functions for adding new rows, getting rows values, clearing all data and checking if container is empty or not.
Functions in AbstractData class are only declared, but not defined. AbstractData class don’t know how “raw” data is actually saved in container class. It is up to user to define such things in concrete classes.
First concrete container class in qtcsv library is StringData. From its name you can guess how this container hold data. Yes, in strings. Or to be more precise - in QList<QStringList>. StringData implements all abstract functions of AbstractData plus it has some container-specific functions like insertRow(), replaceRow() and operator«(). StringData works only with strings. It accepts new data only in strings, it returns data in strings and so on. So it is best to use StringData if your data is already represented in strings.
Second concrete container class in qtcsv library is VariantData. As StringData, it implements all abstract functions of AbstractData plus it has some container-specific functions. The main difference is that VariantData holds data in QVariant type (QList<QList<QVariant>>). In Qt QVariant is a very specific and useful type. It works like a wrapper for a many types. You can create QVariant variable from int, string, bool, double or Qt-specific classes like QDate, QByteArray and many others (see list of constructors of QVariant in Qt documentation). So if your data is a set of strings, ints and doubles, it is easier to use VariantData as a container for your data than StringData. Because when it will be time to write your content to file, VariantData automatically transform data to strings.
I see the use of VariantData as a solution to the first drawback of the traditional approach to working with csv-files. With VariantData you don’t need additional step of data transformation. In most cases. But if your data use specific type to store information, VariantData won’t help here because it don’t know how to transform your specific type to string. It will be your work.
So now we know how to store information in containers of qtcsv library. Next let’s see how to write it to csv-file using Writer class. Refer to Writer section in Readme file of qtcsv project.
Writer has only one function - write(). This function have many arguments, but most of them have default values. That amount of arguments is explained by desire to provide the most flexible way of working with csv-files. You can change parameters on the fly, no recompilation (kind of my solution to the third drawback).
Writer sends data to file by chunks. It collects several rows from original data, transform it to string (if necessary), adds separators, new line symbols and send to QTextStream which is then send it to QFile. This cycle repeats till there is no data left. This approach is very useful when you have big amount of data. Writer will not take much memory so your program will run smoothly. Also Writer provides solution to the 2.2 drawback. It will convert your data to strings only when it necessary.
Finally we get to the Reader. The purpose of this class is obvious - reading content of the csv-file. Refer to Reader section in Readme file of qtcsv project. Unfortunately Reader don’t provide solution to drawback 2.1. It reads the whole file at once and saves all its content to container.
And this is it. I reviewed all public classes of qtcsv library. I don’t think it is perfect - it is definitely not. qtcsv do not eliminate all the drawbacks I have mentioned. But it provides simple and easy-to-use interface to work with csv-files. Plus it could be extended to meet your needs.
I also created example project, that use qtcsv library. It’s like a quick-start or playground. I hope it will be useful for your.
https://github.com/iamantony/qtcsv-example.git