HUF 2024
Aula
SCC
-
-
9:00 AM
→
9:45 AM
Bus Transfer 45m Leonardo Hotel
Leonardo Hotel
-
10:00 AM
→
10:30 AM
Registration 30m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen -
10:30 AM
→
10:45 AM
Welcome 15m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenWelcome
Speaker: Achim Streit (KIT-SCC) -
10:45 AM
→
11:00 AM
Introduction to HUF 2024 15m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenSpeaker: Doris Ressmann (Karlsruhe Institute of Technlology) -
11:00 AM
→
12:00 PM
Support Update 1h Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenSpeaker: Jonathan Procknow -
12:00 PM
→
1:30 PM
Lunch break 1h 30m 126
126
SCC
-
1:30 PM
→
2:00 PM
KIT's Site report 30m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenDorin
Speaker: Dorin Lobontu -
2:00 PM
→
2:20 PM
IN2P3 Site Status 20m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenUpdates on HPSS at IN2P3 Computing Center
- Infrastructure and HPSS upgrade
- ARM architecture support
Speaker: Pierre-Emmanuel BRINETTE (IN2P3 / CNRS) -
2:20 PM
→
2:50 PM
Coffee Break 30m 126
126
SCC
-
2:50 PM
→
3:10 PM
Transitioning HPSS Monitoring from Nagios to VictoriaMetrics 20m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenLast year at NERSC we retired our long-standing nagios-based HPSS monitoring deployment in favor of VictoriaMetrics, Loki and Alertmanager. We would like to share our experience and lessons learned on the way.
- Motivation for making this transition
- Limitations of Nagios-style monitoring
- How does VictoriaMetrics address these?
- General overview of our monitoring deployment
- 3rd party exporters
- Custom exporters/"plugins"
- Demonstration of some of the dashboards we use and alerts we generate.
- Future areas of improvement
- Standardizing our HPSS-specific data collection
- Service discovery
Speaker: Mr Basil Lalli (NERSC - LBNL) - Motivation for making this transition
-
3:10 PM
→
3:40 PM
HPSS monitoring at KIT 30m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenPreslav
Speaker: Preslav Konstantinov (KIT) -
3:40 PM
→
4:00 PM
Introduction to GridKa Tour 20m Aula
Aula
SCC
Speaker: Andreas Petzold (KIT) -
4:00 PM
→
4:30 PM
GridKa Tour 1 30m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen -
4:30 PM
→
5:00 PM
GridKa Tour 2 30m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen -
5:00 PM
→
8:00 PM
Flammkuchen Event (Tarte Flambé) 3h Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen -
8:00 PM
→
8:45 PM
Bus Transfer 45m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen
-
9:00 AM
→
9:45 AM
-
-
9:00 AM
→
9:45 AM
Bus Transfer 45m Leonardo Hotel
Leonardo Hotel
-
10:00 AM
→
10:45 AM
HPSS Release Roadmap (Restricted) 45m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenSpeaker: Michael Meseke -
10:45 AM
→
11:00 AM
Implementing a Virtualized HPSS Deployment for Testing and Development 15m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenAs part of our efforts to upgrade our site to HPSS 10.3, Indiana University recently began development of a virtualized HPSS environment that we can use to quickly iterate on testing and development initiatives without tying up limited bare-metal hardware resources. This virtual-machine environment is patterned on the implementation created by IBM's HPSS Support team for use at the recent HPSS 10.3 Training held in May 2024.
Topics to be discussed include provisioning the VM, installing and configuring a virtual tape library using the mhVTL open-source software package, installing and configuring both DB2 and HPSS, and possibly a sampling of the sorts of issues we intend to test using this environment.
If technical affordances permit, this presentation could potentially include a live demonstration of the VM running on an external flash drive. Otherwise, we would be happy to present using the traditional static PowerPoint.
Speaker: Dr Forrest Greenwood (HPSS Subsriber) -
11:00 AM
→
11:30 AM
Coffee Break 30m 126
126
SCC
-
11:30 AM
→
11:45 AM
HPSS S3 Scalability With Rubin LFA S3 Store Use Case 15m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenSLAC National Accelerator Laboratory
Technology and Inovation Department
Scientific Computing SystemsWith a growing demand on HPSS S3 support from SLAC science user’s community, we eagerly started testing HPSS S3 interface since the pre-GA release in July 2023. From the initial fragile and immature version to today’s more robust and resilient state, we worked directly with HPSS S3 developer’s team to troubleshoot and triage many challenging issues faced with the scalability, data IO performance and the large and deeply nested data structure handling for very small files from Rubin LFA ceph S3 store use case. In this presentation we’ll tell our stories in the journey of bring HPSS S3’s capability to a next level.
Speaker: Ms Guangwei Che -
11:45 AM
→
12:00 PM
Testing HPSS S3 Interface at MPCDF 15m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenStarting from version 10.3, HPSS has an S3 interface. We at MPCDF have installed it on our test system to try it out in several usage scenarios including cloud sync - using Ceph Cloud Sync module as well as rclone, generating presigned URLs, and just using different S3 clients. Among our test actions, we are trying out put, get, remove S3 objects as well as getting S3 objects from tape. This talk covers test scenarios, their setup and results, encountered issues and their fixes.
Speaker: Elena Summer (Max Planck Computing and Data Facility (MPCDF)) -
12:00 PM
→
1:05 PM
Lunch break 1h 5m 126
126
SCC
-
1:05 PM
→
1:45 PM
BOF Monitoring 40m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen -
1:45 PM
→
2:15 PM
NeRSC Site Report 30m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenNERSC Site Report HUF24
Karlsruhe Institute of Technology Campus Nord from 09-12 September 2024Abstract
Topics to discuss
- NERSC Stats - PBs, etc
- Upgrade HPSS 7.4.3 to 9.3
New RHEL8 Core Servers.
New FS7300 metadata arrays.
Update existing movers to RHEL8.
Updated PAM auth module to work with NERSC Auth. - Install 4th TS4500 tape library
16 Frame 1188+ slots.
testing 10.0.1 firmware with SSL for REST over Ethernet.
TS1160 drives JE media, while we figure out TS1170/JF.
Total Theoretical capacity 950PB on JE, 2.37EB on JF media. - Issues deploying TS1170 in our air cooled environment
- Monitoring update (brief, specific talk to follow)
- REST over ethernet testing
Speaker: Mr Owen James (NERSC/LBNL) -
2:15 PM
→
2:35 PM
Spectra Logic 20m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenSpeaker: Matt Starr -
2:35 PM
→
3:05 PM
Coffee Break 30m 126
126
SCC
-
3:05 PM
→
4:05 PM
HPSS Object Storage Class Deep Dive 1h Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen2024 HUF Presentations by IBM
!!duration 1h
Speaker: Greg Thorsness -
4:05 PM
→
4:35 PM
Have I right-sized my disk cache? 30m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenAbstract: For most sites, the HPSS disk cache is a critical component of the HPSS configuration, helping boosting performances of storing and retrieving data from the archive. However, it may be a bit of a black art to assess how big the disk cache should be, especially in environments that have grown over the tears. This talk will present a couple of tools that have been developed at NERSC, that allow us to assess the effectiveness of an existing cache, and give some insight on the impact of increasing or decreasing that disk cache size.
Speaker: Francis Dequenne (LBL) -
4:40 PM
→
5:25 PM
Bus Transfer 45m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen -
8:15 PM
→
11:15 PM
Schlosslichtspiele 3h Karlsruhe Schloss
Karlsruhe Schloss
-
9:00 AM
→
9:45 AM
-
-
9:00 AM
→
9:45 AM
Bus Transfer 45m Leonardo Hotel
Leonardo Hotel
-
10:00 AM
→
11:00 AM
Upcoming HPSS Features (Restricted) 1h Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen2024 HUF Presentations by IBM
!! duration: 1,5h
Speaker: Michael Meseke -
11:00 AM
→
11:30 AM
Burning Issues (Restricted) 30m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen2024 HUF Presentations by IBM
Speaker: Jonathan Procknow -
11:30 AM
→
11:50 AM
Staging ~2 Million Files from Tape for a User 20m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenImagine you turn on your work laptop or arrive at the office and find this message from the customer service team: "we have a user that is trying to retrieve many files from the HPSS archive. At the current retrieval rate, we estimate it will take 6 months for the user to retrieve all the files in the dataset. Can you help?"
What do you do? How do you proceed? What features does HPSS offer to help with this situation?
I'll answer those questions and more as we examine LLNL's approach to retrieving nearly 2 million files across 10's of tape volumes with tools like
quaid
, SQLite, and RabbitMQ (along with a bit of custom Python code).Speaker: Mr Geoff Cleary (LLNL) -
11:50 AM
→
12:10 PM
BOF Client Interfaces 20m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen -
12:10 PM
→
1:30 PM
Lunch break 1h 20m 126
126
SCC
-
1:30 PM
→
1:40 PM
Group Foto 10m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen -
1:45 PM
→
2:05 PM
Exploring storage technologies for HPSS disk caches 20m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenAt KIT we operate HPSS as a tape system for the GridKa WLCG Tier-1 and for the Baden-Württemberg Data Archive service. Performance limitations of the HPSS disk cache systems led us to explore new technology options for the disk cache, based on classic storage systems with SSDs, and storage servers with local NVMe devices. We will present details on the different possible solutions, including benchmarks.
Speaker: Andreas Petzold -
2:05 PM
→
2:50 PM
Managing Data Throughout Its Lifecycle: Lessons Learned and Future Directions 45m Aula
Aula
SCC
Abstract: Data lifecycle management poses significant challenges, particularly in academic and research environments where data accumulation is rapid and perpetual. This presentation delves into the complexities surrounding data retention and abandonment, highlighting the prevalent issues of data hoarding and the lack of structured deletion policies. Specifically, it addresses the dilemma wherein users, especially researchers, find little incentive to delete data, leading to a cluttered and often inaccessible data landscape. Furthermore, the departure of users from institutions like Indiana University (IU) exacerbates the problem, as data may be left behind with no clear ownership or accessibility.
Indiana University is tackling these issues gradually. We'll discuss our efforts to address data management and abandonment through:
New usage constraints: Instituting new quotas with tiered growth guidelines.
Simplified Archiving and Movement: Providing user-friendly tools to archive and migrate data to appropriate storage tiers.
Data Management Education: Empowering researchers with best practices for data stewardship.
Insuring allocation value: Requiring annual renewal of desired resources.
The "Digital Will" Concept: Developing a system where departing users can designate data inheritors and define deletion policies.
By examining the successes and pitfalls of these initiatives, this presentation provides valuable insights into effective data lifecycle management strategies. It underscores the importance of fostering a culture of responsible data stewardship while leveraging technological innovations to facilitate seamless data management throughout its lifecycle.
Speaker: Mr Charles McClary (HPSS Subscriber) -
2:50 PM
→
3:05 PM
Coffee Break 15m 126
126
SCC
-
3:05 PM
→
3:50 PM
HPSS Core Servers on Commodity Hardware or: How We Learned to Love Databases on ZFS 45m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenAt LLNL we have been using commodity hardware more and more to serve our parallel filesystems and archival storage clusters. We wanted to explore how to use this same hardware for our HPSS Core Server systems. In order to make the system as reliable as possible, ZFS emerged as the underlying filesystem we wanted to utilize for its reliability and other advanced features. How would traditional databases perform on top of ZFS? Could we design a production-worthy system using this hardware?
Speaker: Herb Wartens -
4:00 PM
→
4:45 PM
Bus Transfer 45m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen -
5:00 PM
→
6:00 PM
ZKM Tour 1h Lorenzstraße 19 Karlsruhe 76135 (ZKM)
Lorenzstraße 19 Karlsruhe 76135
ZKM
-
6:00 PM
→
9:00 PM
Conference Dinner 3h Lorenzstraße 19 Karlsruhe 76135 (ZKM)
Lorenzstraße 19 Karlsruhe 76135
ZKM
-
9:00 AM
→
9:45 AM
-
-
9:00 AM
→
9:45 AM
Bus Transfer 45m Leonardo Hotel
Leonardo Hotel
-
10:00 AM
→
10:40 AM
Generative AI and HPSS 40m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen2024 HUF Presentations by IBM
!! duration 1hSpeaker: Greg Thorsness -
10:40 AM
→
11:10 AM
MPCDF Site Report 30m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenWe will present our activities with HPSS since the last HUF, including our upgrade to HPSS 10.3.
Speaker: Manuel Panea (Max Planck Computing and Data Facility) -
11:10 AM
→
11:40 AM
Coffee Break 30m 126
126
SCC
-
11:40 AM
→
12:05 PM
Restful SSM 25m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen2024 HUF Presentations by IBM
!!duration 1h
Speaker: Fabi Adams -
12:05 PM
→
12:25 PM
SSC Site Report 20m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenSSC Site Report will be a recap of previous HUF presentations and focus on :
- Solution overview
- Review of components throughout upgrades
- HPNLS (High Performance Nearline Storage) architecture
- HPSS and RHEL Upgrade
- HPSS monitoring
- User tools and environment
Speaker: Tarak Patel -
12:25 PM
→
1:55 PM
Lunch break 1h 30m 126
126
SCC
-
2:00 PM
→
2:20 PM
JAXA Site Report 20m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenThe recent operation status of JAXA HPSS "J-SPACE", the plans and issues for its replacement in 2025, and monitoring functionality will be reported.
Speaker: Naoyuki FUJITA (Japan Aerospace Exploration Agency(JAXA)) -
2:20 PM
→
2:30 PM
Closing HUF 2024 10m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-LeopoldshafenSpeaker: Doris Ressmann (Karlsruhe Institute of Technlology) -
3:00 PM
→
3:45 PM
Bus Transfer 45m Aula
Aula
SCC
Hermann von Helmholtz Platz 1 76344 Eggenstein-Leopoldshafen
-
9:00 AM
→
9:45 AM