How to Train a GPT From Scratch

An experiment reproducing NanoGPT and lessons learned

In this user experiment blog, Akash Kundu details his experience using Chameleon to replicate a GPT from scratch on a corpus of Shakespeare texts. Using Chameleon Cloud, he highlights the ease and impact of training generative pre-trained transformers (GPTs), aiming to democratize access to this technology and highlight the importance of reproducible experiments.

Announcing the 2024 Chameleon User Survey - Send Us Your Feedback by Feb. 6th!

Planning the next iteration of hardware and services on Chameleon Cloud

We invite all users to submit feedback through the survey. The responses we receive will directly impact future development of Chameleon, including where to focus hardware procurement efforts and extension of testbed capabilities. The survey is a great opportunity to help guide the future of Chameleon! We will accept responses until Feb. 6, 2024.

Chameleon Changelog for December 2023

Happy New Year! We are back and happy to see you are back as well! In this changelog edition we highlight the most significant testbed changes and upgrades from the last year and hint at some surprises yet to come. Looking forward to working with you in 2024!

Tickets of the Year on Chameleon (2023)

Read our tips and tricks for some common issues that users faced in 2023

Get the inside scoop on Chameleon Cloud’s common user issues and smart fixes in our latest 'Tips and Tricks' blog, highlighting a year of learning and problem-solving.

Happy Holidays to the Chameleon Community!

Updates on Chameleon during the winter break

In this end-of-year blog, the Chameleon Team expresses gratitude for the community's support and achievements in 2023. Highlighting the innovative work showcased in user experiment blogs, the team invites more exploration. The blog also announces the holiday schedule for the Chameleon Help Desk and the exciting return of the "Chameleon for Christmas" double lease offer, allowing extended access to resources, including GPU nodes. The post concludes with warm holiday wishes and anticipation for continued collaboration in 2024.

IndySCC Site Open to All Chameleon Users Until January 5th - Explore Now!

Early holiday gift comes to Chameleon Cloud

Great news for the Chameleon Cloud Community! The IndySCC site, previously exclusive to IndySCC teams, is now accessible to all users until January 5th, thanks to the University of Colorado Boulder's support. With 164 nodes featuring powerful hardware, including 2x 12 core Haswell CPUs and 100G Intel Omni-Path, it's a fantastic chance to expand your research capabilities. Remember, new reservations won't be accepted after January 5th, so make the most of this opportunity while it lasts.

Design Considerations and Analysis of Multi-Level Erasure Coding in Large-Scale Data Centers

Revolutionizing Data Storage with Multi-Level Erasure Coding

Delve into the intricate world of Multi-Level Erasure Coding (MLEC) and its application in large-scale data centers. Meng Wang, a Ph.D. candidate at the University of Chicago, presents comprehensive design considerations and analysis of MLEC, highlighting its advantages over traditional single-level erasure coding. The blog is aimed at exploring the significant impact of MLEC on data redundancy and storage efficiency.

Chameleon Changelog for November 2023

We had a busy month at Super Computing earlier this month and with IndySCC. We also have been preparing for a virtual reproducibility hackathon next month. Changes this month include improvements to resource reservations, CHI-in-a-box, appliance fixes, and JupyterHub updates.

Join Us in Advancing Scientific Research through Reproducibility on Chameleon Cloud

Chameleon Cloud's initiative for advancing scientific research through reproducibility invites the community to contribute to a growing repository of reproducible experiments. With state-of-the-art resources, Chameleon Cloud's Trovi service enables researchers to create and share their experiments as reusable artifacts. The upcoming Reproducibility Hackathon, a virtual event on December 15, 2023, offers an opportunity to transform experiments into valuable community resources. Users are encouraged to submit their Chameleon-based experiments for reproducibility packaging to enrich the platform, fostering a collaborative and innovative research environment.

High School Summer Research Students at NYU Investigate Cloud and Edge Inference on Chameleon

Exploring Cloud and Edge Inference: High School Students' Journey Through Machine Learning Research with Chameleon at NYU

This blog post outlines the experience of high school students engaging in a summer research program at NYU, focusing on cloud and edge Machine Learning inference projects utilizing the Chameleon platform and associated Trovi artifacts. The authors detail their practical exploration into machine learning at the cloud and the edge, review results, and discuss the technical challenges encountered and the solutions developed.