Support #66
Get backups running on gdp-0[1234]
Related issues
History
#1 Updated by Nitesh Mor over 6 years ago
- Status changed from New to In Progress
- Split the 4TB disk on gdp-0[1234] into two logical volumes each, one for own data and one for the backup of a remote log-server.
- Created a different user gdp-backup, with SSH keys setup for password-less login.
- Changed the umask setting for the log-server to be a little more relaxed, such that gdp-backup can read the data
Data transfer in progress. Looks like a few hours for gdp-01's 100+GB data files.
#2 Updated by Nitesh Mor over 6 years ago
- Related to Support #65: Upgrade gdp-01 to Ubuntu 16.04 added
#3 Updated by Nitesh Mor over 6 years ago
It turns out that naively doing an rsync
is not a very good strategy. Default behavior of rsync
is
- make a decision whether a file needs to be synced or not based on modification time and filesize.
- if so, then do a full sync based on a rolling checksum.
With such an approach, if all the logs get a single new record appended, this kind of rsync
will require reading the entire contents of the disk. However, with the knowledge that the .gdplog
files are only appended to, we can use the --append
flag safely. For .gdpndx
and .gdptidx
files, this assumption does not hold true necessarily. But we can always regenerate them from the data file, so we don't even need to do any backups for them.
#4 Updated by Eric Allman over 6 years ago
- Status changed from In Progress to Resolved
I believe backups are working now. Nitesh, can you confirm?
#5 Updated by Nitesh Mor over 6 years ago
- Status changed from Resolved to Closed
Confirmed that backups are working. Closing the issue.