Monday, December 17, 2007

Computer problems

Basically a computer problems day!

During the weekend I had several crashes of my machine due to some "overload" trying to download, checksum, unzip and copy to external disk the large SDSS files, "all at once". The best guess is a temperature problem ...

Anyway, that happened on the same time as I found out I couldn't have a file larger than 16 GB in the external disk. After googling I found the reason (a bit tricky because value that ls -lh showed was 17 GB because it shows in base "1000" and not "1024"). Ext2 with a block size of 1K has a limit of 16 GB per file and it increases to 256 GB with a block size of 2K and so on. So, reformating the whole array was the solution.

Then with so many crashes the file system required a manual check, only done by the sysadm, who was only available in the afternoon. So, no desktop almost the whole day. But when that happens my whole account stops, until today for an unknown reason. The other sysadm (that takes care of the servers) found out why. Environment variable defined to a local disk requires a time out on every action. OK, variable redefined and things work.

Also, file system checked and now the array mounts with the correct file system and at the desired mounting points. That should make the manipulation of data in the disks faster (I hope, checking it NOW). Also I'm testing the temperature issue putting a "real" fan blowing at the open cpu. Let's see.

In the science field. Preparation of two more clusters is going on. One is still running and the other one is ready, there in Innsbruck. The question is how to bring the 16 GBs here.

0 Comments:

Post a Comment

<< Home