Revision 14079
Added by Aaron Marcuse-Kubitza over 10 years ago
README.TXT | ||
---|---|---|
118 | 118 |
Note that import_all will take up to an hour to import the NCBI backbone |
119 | 119 |
and other metadata before returning control to the shell. |
120 | 120 |
To view progress: |
121 |
tail inputs/{.,}*/*/logs/$version.log.sql |
|
121 |
tail inputs/{.,}??*/*/logs/$version.log.sql
|
|
122 | 122 |
note: at the beginning of the import, the system may send out CPU load |
123 | 123 |
warning e-mails. these can safely be ignored. (they happen because the |
124 | 124 |
parallel imports use all the available cores.) |
... | ... | |
136 | 136 |
upload logs: make inputs/upload live=1 |
137 | 137 |
On local machine: make inputs/download-logs live=1 |
138 | 138 |
check for disk space errors: |
139 |
grep --files-with-matches -F 'No space left on device' inputs/{.,}*/*/logs/$version.log.sql |
|
139 |
grep --files-with-matches -F 'No space left on device' inputs/{.,}??*/*/logs/$version.log.sql
|
|
140 | 140 |
if there are any matches: |
141 | 141 |
manually reimport these datasources using the steps under |
142 | 142 |
Single datasource import |
143 | 143 |
bin/after_import & |
144 | 144 |
wait for the import to finish |
145 |
tail inputs/{.,}*/*/logs/$version.log.sql |
|
145 |
tail inputs/{.,}??*/*/logs/$version.log.sql
|
|
146 | 146 |
In the output, search for "Command exited with non-zero status" |
147 | 147 |
For inputs that have this, fix the associated bug(s) |
148 | 148 |
If many inputs have errors, discard the current (partial) import: |
... | ... | |
202 | 202 |
now in the copied tab instead |
203 | 203 |
Insert a copy of the leftmost "By column" column group before it |
204 | 204 |
export version=<version> |
205 |
bin/import_date inputs/{.,}*/*/logs/$version.log.sql |
|
205 |
bin/import_date inputs/{.,}??*/*/logs/$version.log.sql
|
|
206 | 206 |
Update the import date in the upper-right corner |
207 |
bin/import_times inputs/{.,}*/*/logs/$version.log.sql |
|
207 |
bin/import_times inputs/{.,}??*/*/logs/$version.log.sql
|
|
208 | 208 |
Paste the output over the # Rows/Time columns, making sure that the |
209 | 209 |
row counts match up with the previous import's row counts |
210 | 210 |
If the row counts do not match up, insert or reorder rows as needed |
211 | 211 |
until they do. Get the datasource names from the log file footers: |
212 |
tail inputs/{.,}*/*/logs/$version.log.sql |
|
212 |
tail inputs/{.,}??*/*/logs/$version.log.sql
|
|
213 | 213 |
Commit: svn ci -m 'inputs/import.stats.xls: updated import times' |
214 | 214 |
Running individual steps separately: |
215 | 215 |
To run TNRS: |
Also available in: Unified diff
bugfix: /README.TXT: logs glob: don't unintentionally match . or ..