Qstar HSM Troubleshooting Guide
Qstar HSM Troubleshooting Guide
Trademark credits
Agfa and the Agfa rhombus are trademarks or registered trademarks of Agfa-Gevaert N.V., Belgium
or its affiliates. IMPAX, Connectivity Manager, Audit Manager, WEB1000, IDCV XERO, TalkStation,
Heartlab, and HeartStation are trademarks or registered trademarks of Agfa HealthCare N.V. or its
affiliates. All other trademarks are held by their respective owners and are used in an editorial fashion
with no intention of infringement.
Manufacturer’s responsibility
The manufacturer, installer, or importer will be responsible for the safety, reliability, and performance
of the equipment only if:
• Installation, modifications, adjustments, changes, or repairs are performed by suitably qualified
service personnel.
• The electrical installation of the site in which the equipment is used is according to an applicable
safety standard (UL, CSA, or IEC/CDE).
• The equipment is used according to the instructions provided in the operation manuals.
Thank you for taking the time to provide feedback. Your comments will be forwarded to the group
responsible for this product’s documentation.
If you have a request for changes in the product, please contact your product sales representative or
the product’s technical support channel.
1 Getting started 6
Requirements for troubleshooting QStar HSM.........................................................................6
Syslog and SCSI log files.........................................................................................................6
Troubleshooting strategies for QStar HSM..............................................................................7
Opening a ticket with QStar technical support.........................................................................7
Glossary.......................................................................................................................................48
Index............................................................................................................................................50
• Knowledge of the architecture of the second archive tier and its integration between QStar
HSM and IMPAX
QStar HSM uses log files to record system information and error messages that can assist with
troubleshooting.
Analyze the syslog in tandem with the SCSI log file to obtain a clear understanding of the environment.
The primary goal of troubleshooting QStar HSM is to restore the original configuration back online,
and where possible, recover data from a damaged media. Use the following strategy when
troubleshooting QStar HSM:
1. Identify the component not working.
2. Identify the error that is generated.
3. Identify the impact of the error (for example, volume not available, mirrors not in-sync, or
retrieve not working).
Tip:
Failure to consider and research all relevant error messages can obscure the current status
of the entire system, which can lead to additional errors or improper configurations. Adjust
the troubleshooting strategy depending on the defective component (for example, media,
drive, library, or software).
As per the support contract, Agfa technical support provides first line support to the customer. Agfa
initially investigates the technical problem and tries to solve it. If the problem cannot be resolved,
Required information
To open a support ticket, the Agfa Field Service Engineer (FSE) must provide the following
information:
Information to provide
Host ID
Commands output
Windows
• vlcmd lsset -va
Solaris
• vllsset -va
• vllsdev jb_name
• mmparam set_name
• jbstatus jb_name
Configuration issues include all problems generated by improper configuration or use of the QStar
HSM software.
All debugging and troubleshooting occurs by checking and analyzing the syslog and the SCSI log
files. For information about how to obtain these files for the different operating systems, see Syslog
and SCSI log files (refer to page 6) or refer to the QStar Administrator's Guide.
Issue
IMPAX cannot store data or retrieve studies. QStar HSM does not report any errors.
Details
IMPAX archives the data in the volume set. When the volume is not mounted, this configuration
issue can arise from the IMPAX side directly being unable to store the data. QStar HSM itself does
not report any errors since it is not an illegal configuration state.
Issue
The QStar HSM syslog reports a message in the console of the GUI stating
Bring online media
The following is an example of the syslog:
06/08/2011 15:20:59 sdf_migrator: archive: sdf_get_vh: failed to open volume 16
(flag 0, original 2, fallback 2, vl_status 0x0)
06/08/2011 15:20:59 sdf_migrator: archive: sdf_ev_page_fault: return with error:
sdf_fill_page_buf: tn_num = 484776, page_num = 0, vol = 16, position = 5015502:
read_buf open: sdf_get_vh: Medium in set is offline
Details
The system goes into covered filesystem error. You cannot archive, but jobs can still be retrieved.
The error state is shown in QStar Administration Interface dialog.
Solution
1. To import the media previously exported, navigate to QStar Administration Interface > Media
> Online Media.
2. After importing the media, to clear the error condition status, navigate to QStar Administration
Interface > Integral Volumes > Migration View.
3. Click Clear Error Condition State.
The archive restarts.
Issue
QStar HSM daemons do not start. Users cannot archive or retrieve jobs.
Solution
1. Ensure that the device is correctly visible by the operating system and correctly claimed
(Windows environments).
2. For Windows, from the Start > Programs menu, check the device manager and the QStar
device claiming utility. The device must be claimed and configured in the QStar HSM software
to allow startup of the QStar HSM solution.
Issue
The following error message appears in the QStar HSM syslog and in the console of the GUI (for
Windows environments) or vlconsole –l (for Solaris environments):
Please vladdtoset erased (blank) medium. No medium to initialize
The error appears when you must archive data and the media is not present. Without an import of
the requested media, archiving is not possible. Retrieving might fail as well.
Details
The syslog and the console specify the set that requested the media:
08/11/2010 17:19:42 mmserver: Error condition: Covered error
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: sdf_call_vl: volume 1 not
available
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: sdf_get_vh: failed open
volume 1 (flag 0, original 6, fallback 6, vl_status 1) error 0/0
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: no more volumes available
08/11/2010 17:21:58 SDF: set tape_prime, sequence_number 1: Please vladdtoset
erased (blank) medium.
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: Event passthru ADDVOL: <>:
Operator help required
08/11/2010 17:21:58 SDF: set tape_mirror, sequence_number 1: Please vladdtoset
erased (blank) medium.
08/11/2010 17:21:58 sdf_slave: sdf_sl_open: can't add/get volume 1: Operator help
required
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: sdf_mir_dev_open: MIRR_open:
sdf_get_vh: can't open volume 1: Operator help required
Solution
1. Import the requested media and add the media to the set.
Note:
If the system is still in error, importing does not solve the issue. In this case, the system
remains in Covered Error mode, and neither archive nor retrieve are possible. Follow
the instructions in Troubleshooting: Current writing media offline (manually exported)
(refer to page 10).
Issue
The following message appears in the QStar syslog:
No space left on device
The error appears when you need to archive data and the media is not present. Without importing
the requested media, archiving is not possible. Retrieving might fail as well.
Details
The following is an example of the syslog:
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: no more volumes available
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: Event passthru ADDVOL: <>:
No space left on device
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: sdf_get_vh: secondary failed
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: sdf_get_vh: failed open
volume 1 (flag 0, original 6, fallback 0, vl_status 13) error 3000009/4000003
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: no more volumes available
08/11/2010 17:21:58 sdf_slave: got new volume, sending RESUME: No space left on
device
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: Event passthru MIRR_RESUME:
czc9161cg1:0 (volume 2): No space left on device
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: sdf_get_vh: secondary failed
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: sdf_get_vh: failed open
volume 1 (flag 0, original 6, fallback 0, vl_status 13) error 3000009/3000009
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: cfs_migrator_sdf:
sdf_start_vol: sdf_get_vh: No space left on device
08/11/2010 17:21:58 primary_sdf_migrator: tape_prime: No space left on device
08/11/2010 17:21:58 mmserver: S:\QStarCache: Covered filesystem error: No space
left on device: WRITE: Client UID=0, Client GID=0
Solution
1. Import the requested media and add the media to the set.
If the system is still in error, importing does not solve the issue. In this case, the system
remains in Covered Error mode, and neither archive nor retrieve are possible. Follow
the instructions in Troubleshooting: Current writing media offline (manually exported)
(refer to page 10).
2. After the media has been imported and added to the set, to clear the error condition status,
navigate to QStar Administration Interface > Integral Volumes > Migration View.
3. Click Clear Error Condition State.
The archive restarts.
Issue
No progress in archiving (backlog in the queue) or failure with retrievals.
Details
The library is typically equipped with two drives. This allows it to write two media at the same time
to maintain the mirror status. If a read request (retrieve) occurs during write operations, the write
is stopped, the request is satisfied, and the archive restarts. The library and the tape require time to
perform mechanical operations (like rewinding the tape’s ribbon, unloading the media, loading the
media, and seeking the position) which, can result in a timeout. Similarly, if a resync operation is
in progress, you cannot archive or retrieve data at the same time.
The following screenshot of the QStar HSM GUI shows the mirror status:
Solution
Note:
This is a design issue. Integration knowledge of the architecture of the second archive tier
and its interaction between QStar HSM and IMPAX is essential.
• Size the cache according to the daily customer load. If the cache is too small, the system starts
the archive before the scheduled time and creates an overlap if the retrieve occurs at the same
time.
Issue
Cannot mount the volume set, archive data, or retrieve data.
Depending on the QStar HSM version, several messages appear in the syslog file describing storage
space issues with the cache:
08/24/2010 04:06:12 INFO MM (tape_prime, H:\Qstarcache): H:\Qstarcache: OS reported
available space is 1473 MB, this is less than 1473 MB required by QStar
08/24/2010 04:06:12 INFO MM (tape_prime, H:\Qstarcache):
q_mm_check_cache_files_limit.1866: H:\Qstarcache: cache area full
08/24/2010 04:06:12 INFO MM (tape_prime, H:\Qstarcache): please free space in
cache area or add new cache area
08/24/2010 04:06:12 ERROR MM (tape_prime, H:\Qstarcache): q_mm_sync_strategy.2747:
cache files limit check failed, error condition is set
08/24/2010 08:19:03 INFO MM (tape_prime, H:\Qstarcache): Primary cache: 1475 MB
free
08/24/2010 08:19:03 INFO MM (tape_prime, H:\Qstarcache): error condition cleared
Details
Important!
Data loss can result if the database becomes corrupted.
The cache files change dynamically according to the activity of the volume set. The cache directory
contains the database files, which typically grow with the file system structure.
Solution
1. Enlarge the partition where the cache files are located.
Do not configure the cache size in QStar HSM as the maximum amount of space of the partition
(D:\cache) where it is placed.
If upgrading QStar 4.2.x to 4.3.x or 4.4.x (Solaris) (refer to page 42), the cache’s partition
contains files that grow dynamically, like transaction logs and database files. For this reason,
Agfa suggests leaving at least 10% of the disk space free.
2. If no primary data is present in the cache, unmount and resize the cache, then remount it.
Issue
No access to the volume set. Disconnection of the network drive of the QStar volume.
The services are stopped while an access to the media for retrieve or during rollover is made. If the
device is performing, certain operations are not allowed to unmount or stop the services. For instance,
during rollover, QStar HSM copies the database at the beginning of the media and the stop/unmount
might lead to a wrong or partial media initialization.
Important!
Unmounting or stopping of services during rollover or write/read from the volume is not allowed.
Details
This scenario does not generate a specific error in the syslog.
08/25/2011 10:11:43 qserver: VL: Set is mounted
08/25/2011 10:11:43 vlserver: vl shut server: tape_mirror: Set is mounted
08/25/2011 10:11:43 qserver: VL: retrying ...
08/25/2011 10:11:46 VL: vl shut server: tape_mirror: Set is mounted
08/25/2011 10:11:46 Last message repeated 2 times
08/25/2011 10:11:46 qserver: VL: Set is mounted
08/25/2011 10:11:46 qserver: VL: retrying ...
08/25/2011 10:11:49 VL: vl shut server: tape_mirror: Set is mounted
08/25/2011 10:11:49 qserver: VL: Set is mounted
08/25/2011 10:11:49 vlserver: vl shut server: tape_mirror: Set is mounted
08/25/2011 10:11:49 qserver: VL: retrying ...
08/25/2011 10:11:52 VL: vl shut server: tape_mirror: Set is mounted
08/25/2011 10:11:52 Last message repeated 2 times
08/25/2011 10:11:52 qserver: VL: Set is mounted
Solution
If necessary, contact QStar technical support. For information, see Opening a ticket with QStar
technical support (refer to page 7)
The SMB timeout is the amount of time that Windows waits to access a network resource. The QStar
HSM volume share is available through a network share, which can be affected by the SMB timeout.
For example, QStar requests a file present on a media that is not already in the drive. In this case,
the jukebox might take longer than the SMB timeout to move the media from the shelf to the drive
to satisfy the read request. As a result, Windows disconnects the share and the retrieve fails.
The SMB timeout can be changed through Windows registry keys. QStar HSM has implemented a
utility, setsmbtimeout.exe, is available in c:\qstar\bin.
Note:
Agfa recommends setting the SMB timeout to at least 300 seconds. You must restart the
system after modifying the SMB timeout value.
The MCFS (Magnetic Cache File System) Migration Manager handles the migration of the data and
the files from the media to the cache in two modes:
• Segmented mode—A file is viewed as a collection of data segments of discrete size. Each segment
can be migrated individually or collectively with other segments.
• Full file mode—The Migration Manager always migrates the complete file as a single entity.
For more information about the Migration Manager, refer to Chapter 6 – Migration Manager in
the QStar Administrator's Guide.
To significantly improve tape read performance, change the file migration mode of the files from
segmented mode (default) to full file mode. This forces QStar HSM to bring the full file back to the
cache with one request.
Note:
The time required to complete this procedure depends on whether you are applying it to a
new file system with little or no data, or whether it is an existing file system containing many
files.
To improve read performance for tape using full file mode (Windows)
1. Ensure that no archiving or read operations are in progress.
2. To change all file systems to full file mode, do the following:
a. Open a command prompt.
b. Type
C:\QStar\bin>mmcmd chmod +f integral_volume_name
3. To change one directory at a time to full file mode, do the following:
a. Open a command prompt.
b. To configure the first subdirectory, type
C:\QStar\bin>mmcmd chmod +f integral_volume_name\archive\sub_dir_name
c. After configuring all subdirectories, to configure the archive directory, type
C:\QStar\bin>mmcmd chmod +f primary_tape\archive
d. To configure the root directory, type
C:\QStar\bin>mmcmd chmod +f primary_tape
The Maximum Transfer Size value is displayed in the Jukebox Configuration tab of the QStar
Administration interface. After modifying the Maximum Transfer Size value in the registry, this tab
displays the current value enabling you to verify that the value increased after modifying the registry
entry.
The SCSI Adapter Maximum Transfer Size is the largest single SCSI data transfer that can be issued.
This does not mean the largest file size but rather the largest block that can be read or written to an
optical device with a single SCSI command. When small files must be transferred by issuing multiple
SCSI commands, the impact on read performance can be significant. However, write performance
suffers when this value is too small compared to the file size. Windows SCSI Adapters default allows
a maximum SCSI bus transfer size of 64K (65536 decimal bytes). Increasing the default SCSI adapter
transfer size significantly improves large file transfer performance. Agfa recommends setting the
maximum transfer size to 512 KB or 768 KB.
Recreate the cache for the integral volume set with 32 Write performance is less than optimal.
KB pages size (the default size is 64 KB)
Increase the Maximum Transfer Size of the SCSI Write performance improves.
adapter
To export the volume from Windows only using NFS, create an empty file called nosmb (without
a file extension) in the C:\qstar\install folder.
Issue
With some tape libraries associated with a SAS card or a SCSI card, you cannot attach the library in
QStar HSM. After configuring the library, the system returns the following error message:
SCSI transfer rate less than 129K not enough core
Details
This error means that this configuration must have a SCSI transfer rate greater than 129 KB.
Solution
To avoid this problem, starting from release 5.0.16.908 and later, force the system to work with a
lower transfer rate by placing an empty file (without an extension) called no129k in the C:\qstar\install
folder.
Note:
If the transfer rate displayed in the Jukebox Configuration page is less than 65 KB, even with
this workaround, the library cannot be configured.
When a small amount of memory is dedicated to applications, it forces the applications to use a
large amount virtual memory. This generates a high number of page faults that dramatically reduce
write performance. Changing the memory use priority to programs solves this performance problem
by allowing the applications priority to use real memory instead of virtual memory.
Important!
Technical issues that involve damaged media must be addressed by QStar or managed under
QStar supervision.
Check the syslog and SCSI log for errors involving termination, cabling, or adapter failure. These
logs report the media or drive involved and can provide corrective actions to take. These issues must
first be managed by Agfa support. Escalate to QStar support if additional information is required.
Issue
The following error appears in the syslog, typically during a retrieve from IMPAX or during archive:
07/27/2009 21:26:50 sdf_slave: sdf_sl_read: tape_locate: C:\QStar\dev\JB\rjb29a:
Position setup error: Error doing locate to 3780534: scsi locate: I/O error, sense
key (3 14 00): MEDIUM ERROR: RECORDED ENTITY NOT FOUND
07/27/2009 21:30:21 sdf_slave: sdf_sl_read: tape_locate: C:\QStar\dev\JB\rjb29a:
Position setup error: Error doing locate to 3780536: scsi locate: I/O error, sense
key (3 14 00): MEDIUM ERROR: RECORDED ENTITY NOT FOUND
07/27/2009 21:33:53 sdf_slave: sdf_sl_read: tape_locate: C:\QStar\dev\JB\rjb29a:
Position setup error: Error doing locate to 3780536: scsi locate: I/O error, sense
key (3 14 00): MEDIUM ERROR: RECORDED ENTITY NOT FOUND
07/27/2009 21:37:36 sdf_slave: sdf_sl_read: tape_locate: C:\QStar\dev\JB\rjb29a:
Position setup error: Error doing locate to 3780536: scsi locate: I/O error, sense
key (3 14 00): MEDIUM ERROR: RECORDED ENTITY NOT FOUND
07/27/2009 21:41:07 sdf_slave: sdf_sl_read: tape_locate: C:\QStar\dev\JB\rjb29a:
Position setup error: Error doing locate to 3780536: scsi locate: I/O error, sense
key (3 14 00): MEDIUM ERROR: RECORDED ENTITY NOT FOUND
Details
Tape media read failure. The media is not readable and its content cannot be retrieved.
Note:
Data loss is possible if the media was not synchronized with the mirror.
Solution
1. If the mirror is available (and was correctly written), resynchronize the media.
or
Run the copy tape to recover its contents to the damaged block.
2. If necessary, escalate to QStar technical support.
For information, see Opening a ticket with QStar technical support (refer to page 7).
Issue
The following error appears in the syslog, typically during archive or synchronization operations:
06/14/2011 12:36:28 sdf_do_hotsync: Current Hot Sync status: main(Primary) at:
12/247966, Primary at: 12/247966, Secondary at: 11/9992692, Primary not in sync
06/14/2011 12:36:28 primary_sdf_migrator: tape_prime: Finding end of primary
volume 06/14/2011 12:36:28 primary_sdf_migrator: tape_prime: searching hw mark
on primary (main position - 12:247966, primary position - 11:9992678)
06/14/2011 12:38:14 primary_sdf_migrator: tape_prime: reported hw mark is above
expected (volume 11, position 9992694)
06/14/2011 12:38:14 primary_sdf_migrator: tape_prime: found hw mark at volume 11,
position 9992694
06/14/2011 12:38:14 sdf_do_hotsync: Current Hot Sync status: main(Primary) at:
12/247966, Primary at: 12/247966, Secondary at: 11/9992694, Primary not in sync
06/14/2011 12:43:10 primary_sdf_migrator: tape_prime: sdf_mir_verify_hw: Hot sync
failed at volume 11 position 9992694: verification failed at volume 11 position
9992692: Media are different
06/14/2011 12:43:10 sdf_do_hotsync: Current Hot Sync status: main(Primary) at:
12/247966, Primary at: 12/247966, Secondary at: 11/9992694, Primary not in sync
06/14/2011 12:43:10 mirroring: F:\Qstarcache: Hot sync incomplete: volume 11,
position 9992694: Media are different
06/14/2011 12:48:09 sdf_slave: WRITE[C:\QStar\dev\JB0\rjb26a]: pos=247966,
bk_flags=0x0: TAPE_write: C:\QStar\dev\JB0\rjb26a: Error writing data: scsi write:
I/O error, sense key (b 47 03): ABORTED COMMAND: INFORMATION UNIT CRC ERROR
06/14/2011 12:48:09 primary_sdf_migrator: tape_prime: sdf_send_buf: volume 12,
position 0: MIRR_write (secondary): I/O error
06/14/2011 12:50:47 primary_sdf_migrator: tape_prime: check_written_bufs: buffer
read from position 247966 differs from source ... 0 bufs are OK
This following message is reported in the QStar SCSI log output:
09:51:20.9750 0.0000 JB0:drive1 write_6 0a 00 01 04 00 00 ABORTED COMMAND:
INFORMATION UNIT CRC ERROR=(0b 47 03 Additional: 00 00 00 00 00 00)
Details
This issue is caused by a library problem, bad cabling, or a bad termination.
No data can be archived from the logical volume to the tape. The tape is out-of-sync with the media
if errors appear during a hot sync operation. Check for out-of-sync status:
Solution
1. Check with the hardware vendor to determine if the volumes are out-of-sync.
2. If necessary, escalate to QStar technical support.
For information, see Opening a ticket with QStar technical support (refer to page 7).
Issue
The following error appears in the syslog, typically during a refresh or read operation:
06/07/2011 09:48:14 sdf_slave: sdf_sl_hw_search: C:\QStar\dev\JB0\rjb20a:
qs_scsi_open: find_class: inquiry: No such device
06/07/2011 09:48:14 primary_sdf_migrator: hw mark on mirror search failed at
volume 43: No such device
06/07/2011 09:48:14 primary_sdf_migrator: tape_prime: hw mark on mirror search
failed at volume 43: Hot sync failed at volume 43 position 7934564: No such device
06/07/2011 09:48:14 sdf_do_hotsync: Current Hot Sync status: main(Primary) at:
43/7934612, Primary at: 43/7934612, Secondary at: 43/7934564, Secondary not in
sync
06/07/2011 09:48:14 mirroring: G:\hsm\archive: Hot sync incomplete: volume 43,
position 7934564: No such device
06/07/2011 10:03:13 JB: standard_load wait ready: JB0 Drive 1 marked bad:
C:\QStar\dev\JB0\rdrive1: wait unit ready: scsi test unit ready: Device not ready,
sense key (2 3a 04): NOT READY: MEDIUM NOT PRESENT, MAM ACCESSIBLE
Details
The chip on the media (available from LTO3 generation) is damaged. This is the chip in the cartridge
where format information is stored. If chip content is destroyed, it may restore itself. If it is not
present (and most likely is damaged), the media is unreadable.
Note:
Data loss can result if the media was not synchronized with the mirror.
Solution
1. Read the tape using the vlcopy command.
Issue
The media is locked in the drive. Unloading the media fails and the media appears jammed in the
drive.
Note:
Details
The following error appears in the syslog:
07/30/2010 06:42:03 JB: standard_unload: JB0 Shelf 291 marked bad:
d:\qstar\dev\JB0\jbctl: scsi move:
I/O error, sense key (4 53 00): HARDWARE ERROR: MEDIA LOAD OR EJECT FAILED
07/30/2010 06:42:03 JB: standard_unload: JB0 Drive 4 marked bad: JB0 Shelf 291
marked bad: d:\qstar\dev\JB0\jbctl: scsi move: I/O error, sense key (4 53 00):
HARDWARE ERROR: MEDIA LOAD OR EJECT FAILED
07/30/2010 06:42:03 JB: q_jb_init_jukebox: JB0 Drive 4 marked bad: JB0 Shelf 291
marked bad: d:\qstar\dev\JB0\jbctl: scsi move: I/O error, sense key (4 53 00):
HARDWARE ERROR: MEDIA LOAD OR EJECT FAILED
The following messages appear in the SCSI log:
11:36:14 694 694 1:2:0 test unit ready 00 00 00 00 00 00 NOT READY: UNIT IS IN
PROCESS OF BECOMING READY=(02 04 01)
11:36:15 694 694 1:2:0 test unit ready 00 00 00 00 00 00 NOT READY: UNIT IS IN
PROCESS OF BECOMING READY=(02 04 01)
11:36:16 694 694 1:2:0 test unit ready 00 00 00 00 00 00 UNIT ATTENTION: NOT
READY, MEDIUM MAY HAVE CHANGED=(06 28 00)
11:36:16 694 694 1:2:0 test unit ready 00 00 00 00 00 00
11:36:16 694 694 1:2:0 test unit ready 00 00 00 00 00 00
11:36:16 694 1 803 1:2:0 start 1b 00 00 00 01 00 MEDIUM ERROR: MEDIA LOAD OR EJECT
FAILED=(03 53 00)
Issue
QStar HSM tries to access one of the resources (drive, media, or picker) but no answer is received
from the jukebox. The device is not responding.
The following error appears in the syslog:
09/20/2010 16:15:32 JB: standard_load wait ready: JB0 Drive 1 marked bad:
C:\QStar\dev\JB0\rdrive1: wait unit ready: scsi test unit ready: Device not ready,
sense key (4 44 00): HARDWARE ERROR: INTERNAL TARGET FAILURE
09/20/2010 16:15:32 JB: JB0, Drive 1: C:\QStar\dev\JB0\rdrive1: Sync failed: scsi
sync cache: I/O error, sense key (4 44 00): HARDWARE ERROR: INTERNAL TARGET FAILURE
09/20/2010 16:15:32 JB: Sync failure: JB0 Shelf 1 marked bad: JB0, Drive 1:
C:\QStar\dev\JB0\rdrive1: Sync failed: scsi sync cache: I/O error, sense key (4
44 00): HARDWARE ERROR: INTERNAL TARGET FAILURE
Details
No data loss or corruption is expected, but you must carefully manage the system until the issue is
resolved.
Solution
1. Reboot the library.
2. Check with hardware support to obtain more information on the system's status.
Note:
Although QStar HSM will go into error mode, this issue is caused by a prior hardware problem.
Issue
The following error appears in the syslog:
07/16/2010 08:17:32 sdf_slave: sdf_sl_write: TAPE_write: /opt/QStar/dev/jb/rjb130a:
USCSICMD ioctl:
Error writing data: scsi write:
SCSI cmd 0xa (write_6): I/O error, sense key (b 4b 00): ABORTED COMMAND: DATA
PHASE ERROR
07/16/2010 08:17:32 primary_sdf_migrator: tape_prime: hotsync: volume 163, position
5119654: MIRR_write (secondary): write: I/O error
07/16/2010 08:17:34 sdf_slave: sdf_sl_sync: TAPE_sync: tape_check_pos: Error
flushing drive buffer: First soft position mismatch: Invalid argument
07/16/2010 08:18:46 primary_sdf_migrator: tape_prime: secondary (set tape_mirror)
- jb 130a) position 5119654 (estimated space used 159989MB, capacity - 190734MB).
07/16/2010 08:18:46 primary_sdf_migrator: tape_prime: Please fix the problem and
restart hotsync orvlreplace medium (vlreplace jb 130).
07/16/2010 08:18:46 primary_sdf_migrator: tape_prime: sdf_hotsync: I/O error
Note:
The system might go out-of-sync. If the problem occurs in an already out-of-sync scenario,
the archive might also fail.
Details
If the system goes out-of-sync, the status is indicated in QStar Administration Interface > Integral
Volumes > Migration View > Mirror Media Status.
Note:
Solution
1. Check the drive with a hardware technician.
The system automatically puts the mirror out-of-sync and continues writing on the primary
drive.
2. Unmount or restart QStar HSM services.
This might restart the correct operation.
3. If necessary, escalate to QStar technical support.
For information, see Opening a ticket with QStar technical support (refer to page 7).
Issue
An overwrite in the database is attempted, but fails. The system goes into covered file system error.
You cannot archive and retrieve.
Details
The position reported by the database (where to write) is incorrect. QStar HSM recognizes that the
block already contains data and stops to avoid data corruption. The system goes out-of-sync; the
status is shown in QStar Administration Interface > Integral Volumes > Migration View > Archive
Status.
Solution
1. Obtain the full syslog file.
2. Escalate to QStar technical support.
For information, see Opening a ticket with QStar technical support (refer to page 7).
Issue
The system has a bad drive configuration file. Services do not start and, therefore, the volume cannot
be mounted. As a result, you cannot retrieve or archive.
Details
The hardware configuration has been changed. The configuration settings in QStar HSM are not
physically returned by the device. This can be caused by the following:
• A hardware issue such as a drive not working
Issue
A resource is temporarily unavailable. This can result from any of the following conditions:
• Bad status of the hardware marked in the QStar HSM database
• A hardware error marked the media bad (in QStar Administration Interface > Devices >
Jukebox Status)
Details
Archive or retrieve might not be possible, depending on the required media. If the media needed is
the current writing surface and the volume is not mounted, the QStar HSM volume cannot be
mounted.
The following error message appears in the syslog:
02/04/2011 22:50:15 primary_sdf_migrator: CORA_PRIME: Initiating backup based on
written data size
02/04/2011 22:50:17 primary_sdf_migrator: CORA_PRIME: cfs_migrator_sdf: Event 7:
raising error_status: Resource temporarily unavailable
02/04/2011 22:50:17 primary_sdf_migrator: CORA_PRIME: Resource temporarily
unavailable
02/04/2011 22:50:17 primary_sdf_migrator: CORA_PRIME: cfs_migrator_sdf: event 25,
error_status pending: Resource temporarily unavailable 02/04/2011 22:50:17
primary_sdf_migrator: CORA_PRIME: Resource temporarily unavailable
02/04/2011 22:50:17 ERROR MM (CORA_PRIME, E:\Qstarcache)-0:
q_mm_get_migrator_response.1761: Error from migrator: --> WRITE
\archive\20110202\035ea84c.blob, mode(0100455), ctime: Resource temporarily
unavailable
02/04/2011 22:50:17 ERROR MM (CORA_PRIME, E:\Qstarcache):
q_mm_get_migrator_response.1774: Migrator event WRITE failed, error condition is
set
Solution
1. Retrieve the syslog.
Issue
No archive, retrieve, or hot sync is possible.
The following error appears in the syslog:
04/01/2009 14:49:19 JB: JB0 Shelf 2 marked bad: The request could not be performed
because of an I/O device error., SCSI driver error
04/01/2009 14:49:19 JB: JB0 Drive 1 marked bad: JB0 Shelf 2 marked bad: The request
could not be performed because of an I/O device error., SCSI driver error
04/01/2009 14:50:47 JB: JB0 Shelf 32 marked bad: The request could not be performed
because of an I/O device error., SCSI driver error
04/01/2009 14:50:47 JB: JB0 Drive 2 marked bad: JB0 Shelf 32 marked bad: The
request could not be performed because of an I/O device error., SCSI driver error
04/01/2009 14:51:46 JB: JB0 Shelf 24 marked bad: I/O error, sense key 6 UNIT
ATTENTION 29 00 POWER ON, RESET, OR BUS DEVICE RESET OCCURRED
04/01/2009 14:51:46 JB: JB0 Drive 3 marked bad: JB0 Shelf 24 marked bad: I/O
error, sense key 6 UNIT ATTENTION 29 00 POWER ON, RESET, OR BUS DEVICE RESET
OCCURRED
04/01/2009 14:51:46 JB: JB0 Shelf 29 marked bad: I/O error, sense key 2 NOT READY
04 82 NOT READY, MOTOR TURNED OFF
04/01/2009 14:51:46 JB: JB0 Drive 4 marked bad: JB0 Shelf 29 marked bad: I/O
error, sense key 2 NOT READY 04 82 NOT READY, MOTOR TURNED OFF
Details
Following a hardware error (or any other abnormal situation), the device tries to reset itself by issuing
a Bus Reset command. This command interrupts and stops all the current operations in place by
the system (for example, read or archive).
The Bus Reset command is issued by the device, not by QStar HSM.
Solution
1. Escalate to QStar technical support so that they can check the status of the system.
For information, see Opening a ticket with QStar technical support (refer to page 7).
2. Contact hardware support so that they can identify why the jukebox is issuing the Bus Reset
command.
Issue
The system experiences a hardware error stating internal target failure. Archive or retrieve may not
be possible.
Details
The error indicates that the target of the operation is not available or is in an error state. The target
might be the media or the drive needed for the required operation.
Tip:
If the error occurs while reading media, the media might be corrupt. For more information,
see Troubleshooting: Recorded entity not found (refer to page 26).
Solution
1. Escalate to QStar technical support so that they can check the status of the system.
For information, see Opening a ticket with QStar technical support (refer to page 7).
2. If necessary, contact hardware support
Issue
A request cannot be performed because of I/O device error or SCSI drive error. Archiving or retrieving
might not be possible.
Details
The following error appears in the syslog:
12/20/2010 11:10:33 mirroring: F:\QStarcache: Hot sync incomplete: volume 45,
position 3861944: I/O error
12/20/2010 11:10:56 sdf_slave: sdf_sl_hw_search: sdf_dev_open: The device is not
ready.
12/20/2010 11:10:56 primary_sdf_migrator: hw mark on mirror search failed at
volume 45: The device is not ready.
12/20/2010 11:10:56 primary_sdf_migrator: tape_prime: hw mark on mirror search
failed at volume 45: Hot sync failed at volume 45 position 3861944: The device
is not ready.
12/20/2010 11:10:56 sdf_do_hotsync: Current Hot Sync status: main(Primary) at:
45/6565102, Primary at: 45/6565102, Secondary at: 45/3861944, Secondary not in
sync
12/20/2010 11:10:56 mirroring: F:\QStarcache: Hot sync incomplete: volume 45,
position 3861944: The device is not ready.
12/20/2010 11:34:01 JB: preunload: JB0 Drive 2 marked bad:
C:\QStar\dev\JB0\rdrive2: scsi start/stop: The request could not be performed
because of an I/O device error., SCSI driver error
12/20/2010 11:50:54 JB: preunload: JB0 Drive 2 marked bad:
C:\QStar\dev\JB0\rdrive2: scsi start/stop: The device is not ready., SCSI driver
error
Solution
1. Escalate to QStar technical support so that they can connect and then verify and identify the
cause of the issue.
For information, see Opening a ticket with QStar technical support (refer to page 7).
Agfa provides instructions on how to fix this problem if third-party support is required either
from the operating system or hardware level.
Before starting an upgrade, stop all IMPAX tools. Ensure that no write or retrieve activities will be
performed on the mount point during the upgrade.
You can upgrade QStar HSM to resolve some software issues.
Before starting an upgrade, stop all IMPAX tools. Ensure that no write or retrieve activities will be
performed on the mount point during the upgrade.
Note:
If the mount point is also exported to a third-party server via NFS, unmount and unpublish
the share.
You can upgrade QStar HSM as a way to resolve some software issues.
Important!
For safety, QStar suggests saving device and integral volume set database information
by running vllsdev device_name and vllsset -va. Redirect the output to a file, jbshut.jb.
Before starting an upgrade, stop all IMPAX tools. Ensure that no write or retrieve activities will be
performed on the mount point during the upgrade.
The QStar 5.0 license is provided in file format. Before starting the upgrade, ensure that you have
internet access so that you can receive the response from the QStar license department.
Note:
The QStar 5.0 license file format is incompatible with the cache files for all 4.x version, so the
system administrator will be forced to mount all integral volume sets with a clean cache after
the upgrade.
Before starting an upgrade, stop all IMPAX tools. Ensure that no write or retrieve activities will be
performed on the mount point during the upgrade.
The QStar 5.0 license is provided in file format. Before starting the upgrade, ensure that you have
internet access so that you can receive the response from the QStar license department. The QStar
5.0 license file format is incompatible with the cache files for all 4.x versions, so the system
administrator will be forced to mount all integral volume sets with a clean cache after the upgrade.
Note:
If the mount point is also exported to a third-party server via NFS, unmount and unpublish
the share.
Before starting an upgrade, stop all IMPAX tools. Ensure that no write or retrieve activities will be
performed on the mount point during the upgrade.
The QStar HSM 5.1 license is provided in file format. Before starting the upgrade, ensure that you
have internet access so that you can receive the response from the QStar license department.
Note:
When upgrading to QStar HSM 5.1.x, install the CBFS file system component when
prompted.
Before starting an upgrade, stop all IMPAX tools. Ensure that no write or retrieve activities will be
performed on the mount point during the upgrade.
If the mount point is also exported to a third-party server via NFS, unmount and unpublish
the share.
The QStar HSM 5.1 license is provided in file format. Before starting the upgrade, ensure that you
have internet access so that you can receive the response from the QStar license department.
Important!
To ensure a successful upgrade, do not copy the files used for an upgrade into /opt/QStar.
Doing so will cause the upgrade to fail.
Important!
To protect data, QStar recommends saving device and integral volume sets database
information by running vllsdev device_name and vllsset -va. Redirect the output to a
file, jbshut.jb.
C G
cache GUI
Temporary storage area for data on a Graphical user interface. Visual computer
computer’s local or external hard drives. environment that represents programs, files,
and options with graphical images, such as
icons, menus, and windows.
CIFS
Common Internet File System. This protocol H
defines a standard for accessing remote files. HSM
Regardless of operating system, users can share
files without having to install software. Hierarchical Storage Management. An HSM
archive system provides long-term storage of
data and access to data. Studies archived with
CLUI HSM are stored to a file system. A mount
Command Line User Interface. A point and subdirectory to store studies to is
command-line tool to help in the service of specified. The HSM system handles data
IMPAX MVF. CLUI allows you to execute SQL storage.
statements.
J
D jukebox archive
daemon An archive with one or more drives where
A background process that performs a media is loaded, with multiple slots that hold
specified operation at a predefined time or in the media for easy storage retrieval, and with
response to a specified event. a robotic changer to move media around
within the jukebox.
device
Collection of settings and information used
L
to represent an external system (real world logical volume
device) or internal process (system device). Pooled logical extents can be concatenated
Devices are managed in Service Tools using together into virtual disk partitions called
the Device Manager. logical volumes or LVs. Systems can use LVs
as raw block devices just like disk partitions:
creating mountable file systems on them, or
using them as swap storage. In computer
storage, logical volume management provides
P
partition
An isolated section of memory on a mass
storage device.
S
SCSI
Small Computer System Interface. A
communication interface used between the
archive and the Archive Server.
A unavailable...............................................9
devices
aborted command........................................31 bus reset.................................................37
archiving no space available..................................14
backlog in queue...................................15 not configured or claimed....................11
not possible......................................33, 38 not ready................................................39
unavailable.....................10, 11, 14, 17, 27 unresponsive..........................................30
audience requirements...................................6 disconnection of network drive..................18
available media.............................................13 documentation
giving feedback........................................3
B warranty statement.................................2
drive configuration file................................34
blank check...................................................33 drives
blank media unresponsive..........................................39
unavailable.............................................13
bring online media.......................................10
bus device reset............................................37 E
email
C documentation feedback........................3
erased media
cabling problem...........................................27 unavailable.............................................13
cache size......................................................17
CIFS sharing.................................................23
claimed devices............................................11 F
configuration issues.......................................9 full file mode
copyright information...................................2 configuring file systems........................21
covered file system error........................10, 33
G
D
getting started.................................................6
daemons.......................................................11
damaged chip on media...............................28 H
database corruption.....................................17
database overwrite failure............................33 hardware error
data phase error............................................31 internal target failure.......................30, 38
data retrieval media ejection failure............................29
services stop...........................................18 media load failure..................................29
unavailable...........................11, 14, 15, 17 hot sync.........................................................37
data storage how to troubleshoot QStar HSM..................7
U
unavailable resources...................................36