Chameleon Newsletter & Changelog - October 2025

Testbed updates, new features, webinars, and other exciting news from our user community

Welcome to the Chameleon Newsletter! With so many developments emerging from Chameleon and our surrounding community, our changelog has recently become quite full. Some of these updates are not quite "changelog material," so we are transitioning our monthly changelog into a monthly newsletter. Of course, the changelog will still remain front and center, serving as a valuable resource for important updates to Chameleon infrastructure and services. In addition, the newsletter will include other important announcements, calls for upcoming webinars and events relevant to the Chameleon community, featured community news and resources from and for our users, and a monthly survey question. Check it out below!

If you have interesting tools, research, or events that the Chameleon community would find valuable, please share them with us; we'd love to feature them here. Contact Marc Richardson, mtrichardson@uchicago.edu, to suggest newsletter items for inclusion each month.

 

Hello Chameleon Users!

October was Performance Month for Chameleon Cloud. We're excited to share a variety of performance upgrades for various testbed services, a new Trovi feature, new webinars, user resources, and awesome Trovi artifacts developed by our users!

Changelog

Changes to testbed capabilities and services this month.

Trovi Collections and Citations. Trovi is Chameleon's repository of shared artifacts, allowing you to browse through hundreds of experiments that run on Chameleon. This month, we are excited to announce artifact collections. Now, you can create an artifact and then link it to other related artifacts to form a collection. These links can be used to group together artifacts that are part of a series, such as part of a course, or artifacts that are a part of a conference. We are looking forward to seeing all of the collections that our users will share on Trovi over the next few months!

We've also added a panel in Trovi artifact detail page that shows how to cite the current artifact. Previously, this information was only in our FAQ, which was difficult to find. Now you can easily copy the citation or bibtex information for any artifact from the dashboard directly.

Improved Storage at CHI@TACC and KVM@TACC. We've installed new hardware for the CHI@TACC and KVM@TACC storage cluster, which is used by the object store, filesystem shares, and VM volumes. Previously, this cluster was running on a combination of HDDs and SSDs, and now it entirely operates on NVMe SSDs. Additionally, we've improved the network connection on this cluster, and overall expect improvements to latency and bandwidth. Since this is a shared environment, we don't have guarantees on the performance you'll see in e.g., your KVM instances, but these changes should mean overall smoother IO performance and lower latency with fewer spikes.

Improved Resource Discovery API. This month, we've made significant improvements to our Resource Discovery API. This API is used internally by the Hardware Browser but is also open for your use. We've significantly improved the API response time, meaning that the hardware browser loads much faster. If you were previously using the REST API, please note that the API response has changed slightly. Mainly, we've removed parts of the response that were nonfunctional, such as the cluster status endpoint.

Vendordata Improvements. Vendordata is one of the services that runs on the testbed. It's used to generate the openrc file and the "message of the day" text when you login to a node. This month, we've improved the API response time of the service. We also fixed an issue on KVM, where the generated openrc file would not work properly.

Upcoming Webinars

The Chameleon team hosts webinars on a variety of topics ranging from how to use the testbed to showcasing specific tools, research workflows, and educational projects that users built on Chameleon. Register for an upcoming webinar below! All our past webinars are available to view on the Chameleon Youtube channel.

If you've developed artifacts or tools that could make research or teaching on Chameleon easier for others and are interested in presenting a webinar on that work, we'd love to hear from you! Reach out to us at the Help Desk to discuss your ideas.

Title Date Description
Chameleon: A Configurable Edge to Cloud Environment for Computer Science Research and Education - CMD-IT November Webinar November 6, 2025, 3:00 PM CT Our classic Introduction to Chameleon webinar. Open to the public. Chameleon is a large-scale, reconfigurable testbed supporting research and education in systems, networking, AI/ML, edge computing, and more. This talk will introduce its capabilities and show how you can use Chameleon for your own projects. Register here.
Teaching Machine Learning Operations (MLOps) at Scale November 25, 2025, 11:00 AM CT Professor Fraida Fund (NYU) will share a complete, openly available curriculum she developed for teaching operational ML skills at scale. Drawing from her Spring 2025 course serving nearly 200 students, she will present her pedagogical approach and hands-on learning materials made available through Chameleon's Trovi platform. Topics include cloud computing, DevOps for ML, distributed training, model serving, monitoring, and data systems. Register here.

Keep an eye on our webinar calendar as we announce new webinars for upcoming months. We already have two additional webinars scheduled for future months on creating MPI appliances and organizing artifact evaluations on Chameleon for HPC and computer systems.

Other Upcoming Events

Chameleon will be in St. Louis, MO for Supercomputing 2025 (SC25) this month! The SC activities kick off with our very own Kate Keahey, who will present a paper she co-authored with Fraida Fund, The Cost of Teaching Operational ML, at the EduHPC-25 workshop (paper) on November 16 (10:45 AM). Following that, three students who worked on projects with the Chameleon team over the summer will present their research posters at the ACM Student Research Competition on November 18.

Community News & Resources

Featured news and resources from Chameleon users for this month.

This month, we are excited to feature a NEW Trovi artifact that Chameleon user Ken Raffenetti (ANL) and his student, Rohan Babbar (University of Delhi), built during the Summer of Reproducibility 2025. This artifact is the focus of an upcoming webinar on MPI appliances in December 2025 that Ken Raffanetti will lead. The webinar will walkthrough the details of using the MPI artifact to set up a ready-to-use HPC cluster in the cloud with required libraries for running and managing MPI-based applications. Try it out and then come to the webinar to share your questions and feedback!

Lastly, don't miss out on these useful reads and resources from this month:

Upcoming Maintenance

Chameleon Changelog for August 2025

New H100 hardware on KVM@TACC, and good news about KVM lease limits, quotas, and SU cost. Additionally, we have SU changes for baremetal, an improved the resource browser, and are announcing the deprecation of TACC login.

Chameleon Changelog for July 2025

This month, we have a significant change to KVM@TACC: as per our announcement last month, all KVM instance launches will now require an advance reservation step in the allocation workflow (with the “advance” potentially being once second from now, i.e., on demand). Also, we have cool new hardware on CHI@Edge, and improvements to the “message of the day” on Chameleon-supported base images


Add a comment

No comments