Title: | Terribly-Simple Data Base for Time Series |
---|---|
Description: | A terribly-simple data base for numeric time series, written purely in R, so no external database-software is needed. Series are stored in plain-text files (the most-portable and enduring file type) in CSV format. Timestamps are encoded using R's native numeric representation for 'Date'/'POSIXct', which makes them fast to parse, but keeps them accessible with other software. The package provides tools for saving and updating series in this standardised format, for retrieving and joining data, for summarising files and directories, and for coercing series from and to other data types (such as 'zoo' series). |
Authors: | Enrico Schumann [aut, cre] |
Maintainer: | Enrico Schumann <[email protected]> |
License: | GPL-3 |
Version: | 1.1-0 |
Built: | 2025-01-04 06:11:45 UTC |
Source: | https://github.com/enricoschumann/tsdb |
A terribly-simple data base for numeric time series,
written purely in R, so no external database-software is
needed. Series are stored in plain-text files (the
most-portable and enduring file type) in CSV
format; timestamps are encoded using R's native numeric
representation for
Date
/POSIXct
, which makes them
fast to parse, but keeps them accessible with other
software. The package provides tools for saving and
updating series in this standardised format, for
retrieving and joining data, for summarising files and
directories, and for coercing series from and to other
data types (such as 'zoo' series).
See the functions ts_table
and
as.ts_table
for creating a
ts_table
.
See write_ts_table
and
read_ts_tables
for storing and loading a
ts_table
(or several).
For getting started, see the tutorial at https://gitlab.com/enricoschumann/tsdb/blob/master/README.org or https://github.com/enricoschumann/tsdb/blob/master/README.org .
Enrico Schumann
ts_table
and as.ts_table
for
creating a ts_table
write_ts_table
and read_ts_tables
for
storing and loading a ts_table
Coerce objects to ts_table
as.ts_table(x, ...) ## S3 method for class 'zoo' as.ts_table(x, columns, ...)
as.ts_table(x, ...) ## S3 method for class 'zoo' as.ts_table(x, columns, ...)
x |
object to be coerced to ts_table |
columns |
character |
... |
arguments to be passed to other methods |
A generic function for coercing objects to class
ts_table
.
a ts_table
Enrico Schumann
library("zoo") as.ts_table(zoo(1:5, Sys.Date()-5:1), ## note that the "columns" columns = "value") ## must be specified
library("zoo") as.ts_table(zoo(1:5, Sys.Date()-5:1), ## note that the "columns" columns = "value") ## must be specified
Provides information about data stored in file: columns, number of observations, range of timestamps.
file_info(dir, file)
file_info(dir, file)
dir |
character |
file |
character |
Provide information, such as number of entries, of specified files.
It is recommended that code that uses the returned
information to alter or write tables, should explicitly
check whether a table exists (column exists
in
the returned data.frame
). For instance,
a value of NA
for min.timestamp
would occur for a non-existing file, but also if the
file could not be read for some reason.
An object of type file_info
, which is a
data.frame
with information such as whether a
file exists, minimum and maximum timestamp, and more.
Enrico Schumann
ts <- ts_table(1:3, as.Date("2018-12-3") + 1:3, columns = "A") d <- tempdir() write_ts_table(ts, file = "temp", dir = d) file_info(d, "temp")
ts <- ts_table(1:3, as.Date("2018-12-3") + 1:3, columns = "A") d <- tempdir() write_ts_table(ts, file = "temp", dir = d) file_info(d, "temp")
Read time-series data from files and merge them.
read_ts_tables(file, dir, t.type = "guess", start, end, columns, return.class = NULL, drop.weekends = FALSE, column.names = "%dir%/%file%::%column%", backend = "csv", read.fn = NULL, frequency = "1 sec", timestamp)
read_ts_tables(file, dir, t.type = "guess", start, end, columns, return.class = NULL, drop.weekends = FALSE, column.names = "%dir%/%file%::%column%", backend = "csv", read.fn = NULL, frequency = "1 sec", timestamp)
file |
character |
dir |
character |
t.type |
character: |
start |
a timestamp: either of classes It is best to always specify |
end |
a timestamp: either of classes It is best to always specify |
columns |
character. |
return.class |
|
drop.weekends |
logical |
column.names |
character: a format string for column names; may
contain |
backend |
character: currently, only ‘ |
read.fn |
|
frequency |
character; used compute a regular grid between
|
timestamp |
a vector of timestamps: if specified, only data at
the times in |
Read time-series data from CSV files.
When return.class
is NULL
, a list:
data |
a numeric matrix |
timestamp |
Date or POSIXct |
columns |
character |
file.path |
character |
Otherwise an object of class as specified by argument
return.class
.
Enrico Schumann
t1 <- ts_table(1:3, as.Date("2018-12-3") + 1:3, columns = "A") t2 <- ts_table(4:5, as.Date("2018-12-3") + 1:2, columns = "A") d <- tempdir() ## this is just an example. ## Actual (valuable) data should never ## be stored in a tempdir! write_ts_table(t1, dir = d, file = "t1") write_ts_table(t2, dir = d, file = "t2") read_ts_tables(c("t1", "t2"), dir = d, columns = "A", return.class = "zoo", column.names = "%file%.%column%")
t1 <- ts_table(1:3, as.Date("2018-12-3") + 1:3, columns = "A") t2 <- ts_table(4:5, as.Date("2018-12-3") + 1:2, columns = "A") d <- tempdir() ## this is just an example. ## Actual (valuable) data should never ## be stored in a tempdir! write_ts_table(t1, dir = d, file = "t1") write_ts_table(t2, dir = d, file = "t2") read_ts_tables(c("t1", "t2"), dir = d, columns = "A", return.class = "zoo", column.names = "%file%.%column%")
Create a ts_table
.
ts_table(data, timestamp, columns)
ts_table(data, timestamp, columns)
data |
numeric |
timestamp |
|
columns |
column names |
Create a time-series table (ts_table
). A
ts_table
is a numeric matrix, so there is
always a dim
attribute. For a ts_table
x
, you get the number of observations with
dim(x)[1L]
.
Attached to this matrix are several attributes:
a vector: the numeric representation of the timestamp
character: the class of the original
timestamp, either Date
or POSIXct
a character vector that provides the columns names
There may be other attributes as well, but these three are always present.
Timestamps must be of class Date
or
POSIXct
(POSIXlt
is
converted). A tzone
attribute is dropped.
A ts_table
is not meant as a time-series
class. For most computations (plotting, calculation
of statistics, etc.), the ts_table
must first
be coerced to zoo
, xts
, a
data.frame
or a similar data
structure. Methods that perform such coercions are
responsible for converting the numeric timestamp
vector to an actual timestamp. For this, they may use
the function ttime
(‘translate time’).
a ts_table
Enrico Schumann
ts_table(1:5, Sys.Date() - 5:1, columns = "value")
ts_table(1:5, Sys.Date() - 5:1, columns = "value")
Translate a vector of timestamps.
ttime(x, from = "datetime", to = "numeric", tz = "", strip.attr = TRUE, format = "%Y-%m-%d")
ttime(x, from = "datetime", to = "numeric", tz = "", strip.attr = TRUE, format = "%Y-%m-%d")
x |
numeric |
from |
character: |
to |
character: |
tz |
character |
strip.attr |
logical: strip attributes; in particular, timezone information |
format |
character |
ttime
(‘translate time’) converts timestamps
between formats.
Enrico Schumann
ttime(Sys.Date()) ttime(17397, from = "numeric", to = "Date")
ttime(Sys.Date()) ttime(17397, from = "numeric", to = "Date")
Write time-series data to files.
write_ts_table(ts, dir, file, add = FALSE, overwrite = FALSE, replace.file = FALSE, backend = "csv")
write_ts_table(ts, dir, file, add = FALSE, overwrite = FALSE, replace.file = FALSE, backend = "csv")
ts |
a |
dir |
character |
file |
character |
add |
logical: if |
overwrite |
logical: overwrite existing file when data
differs. |
replace.file |
logical: if |
backend |
a string; currently, only |
The function takes a ts_table
and writes it to
a file.
If the file already exists and both add
and
overwrite
are FALSE
(the default),
nothing is written.
When add
is TRUE
, the function checks
if ts
contains timestamps not yet in the file
and, if there are any, writes only those data.
When overwrite
is TRUE
, the function
merges all observations in the file with those in
ts
and writes the result back to the file. If
ts
contains timestamps that were already in the
file, the data in the file are overwritten. Note that
no data will be removed from the file: timestamps not
in ts
remain unchanged in the file.
Invisibly, the number of data rows written to a file.
Enrico Schumann
t1 <- ts_table(1:3, as.Date("2018-12-3") + 1:3, columns = "A") t2 <- ts_table(4:5, as.Date("2018-12-3") + 1:2, columns = "A") d <- tempdir() ## this is just an example. ## Actual (valuable) data should never ## be stored in a tempdir! write_ts_table(t1, dir = d, file = "t1") write_ts_table(t2, dir = d, file = "t2") read_ts_tables(c("t1", "t2"), dir = d, columns = "A", return.class = "zoo", column.names = "%file%.%column%")
t1 <- ts_table(1:3, as.Date("2018-12-3") + 1:3, columns = "A") t2 <- ts_table(4:5, as.Date("2018-12-3") + 1:2, columns = "A") d <- tempdir() ## this is just an example. ## Actual (valuable) data should never ## be stored in a tempdir! write_ts_table(t1, dir = d, file = "t1") write_ts_table(t2, dir = d, file = "t2") read_ts_tables(c("t1", "t2"), dir = d, columns = "A", return.class = "zoo", column.names = "%file%.%column%")