Frozen Filesystems for Faster PyScript Startup #2417

ntoll · 2025-12-16T18:17:12Z

ntoll
Dec 16, 2025
Maintainer

TL;DR Could we use Emscripten's filesystem API to freeze and archive the complete Python runtime state (including installed packages) into a single downloadable asset? This would eliminate the need for pip-based package installation on every page load, significantly reducing startup time.

Currently, PyScript's startup process works like this (especially where Pyodide is concerned):

Download and initialise the Python interpreter.
Download required packages/dependencies (cached in local storage after first load).
Install packages into the Emscripten virtual filesystem using Pyodide's micropip.
Run the user's application.

While we cache downloaded files in local storage to speed up subsequent loads, there's still a significant cost: on every page load, packages must be copied and installed into the browser's virtual filesystem. This installation can only happen after the Python interpreter is running, since we rely on Pyodide's micropip for the installation process.

I'd like to explore a different (perhaps more efficient) approach ~ instead of installing packages on every load, we could:

Freeze ❄️: Capture the complete state of the Emscripten virtual filesystem after all packages and data files are installed.
Archive 📦: Compress this frozen state into a single downloadable asset (ZIP or similar format).
Restore 🚚: On application startup, unarchive the frozen filesystem directly into the Emscripten virtual filesystem before or during interpreter initialisation.

This mitigates the costs of pip/files related operations on page load. The archived filesystem is downloaded once (and cached), then simply unarchived on subsequent loads. That's it!

It turns out great minds think alike (or fools seldom differ 😛) because @dpgeorge (MicroPython maintainer and friend-of-PyScript) has created something similar called RomFS for MicroPython. In his approach he:

Uses a make command to "freeze" a directory of files into a byte array (the RomFS).
Downloads this asset in the browser.
MicroPython's romfs feature mounts the frozen filesystem at a specified mount point.

Damien described RomFS to me as similar to an ISO image for a CD rom - a frozen snapshot of the filesystem at a point in time. This shows the approach is viable and that others have found it necessary for performance.

Enter the Emscripten Filesystem API, as a possible approach that could work with both Pyodide AND MicroPython.

The approach I've initially taken is to use the public Emscripten FS API to gather/write file contents and directory structure. This approach will work no matter the underlying implementation detail of the filesystem but requires many calls to the Emscripten FS API. There is an alternative approach (which I initially considered, but abandoned because it depends on implementation details rather than the public API): the default implementation of the Emscripten FS is MEMFS (basically a byte array), although other implementations are possible, and we could directly access MEMFS's internal node structure for better performance.

But I think I prefer the simplicity of the approach I've taken: in the following code, the interpreter object is a reference to either MicroPython or Pyodide, both of which have a FS reference to the underlying Emscripten FS API. It's basically a recursive creation of an object whose keys are the names of directories or files, and whose values are other similar objects (for subdirs) or base64 encoded raw data for files - done for simplicity's sake:

/**
 * Recursively freeze a directory into an object. E.g.
 * 
 * my_directory["foo.txt"]  // will get the content of "foo.txt".
 *
 * Sub-directories are child objects of the same type.
 */
function freezeDirectory(interpreter, path) {
  const entries = interpreter.FS.readdir(path);
  const result = {};
  for (const entry of entries) {
    if (entry === '.' || entry === '..') continue;
    const fullPath = path === '/' ? '/' + entry : path + '/' + entry;
    try {
      const stat = interpreter.FS.stat(fullPath);
      if (interpreter.FS.isDir(stat.mode)) {
        // Directory: recursively freeze its contents.
        result[entry] = freezeDirectory(interpreter, fullPath);
      } else if (interpreter.FS.isFile(stat.mode)) {
        // File: store as base64 string with metadata prefix. I know ;-)
        const data = interpreter.FS.readFile(fullPath, { encoding: 'binary' });
        const base64 = btoa(String.fromCharCode(...data));
        result[entry] = `FILE:${stat.mode}:${stat.mtime}:${base64}`;
      } else if (interpreter.FS.isLink(stat.mode)) {
        // Symlink: store target path with metadata prefix. Needs testing.
        const target = interpreter.FS.readlink(fullPath);
        result[entry] = `LINK:${stat.mode}:${stat.mtime}:${target}`;
      }
    } catch (e) {
      // Log stuff that can't be accessed.
      console.warn(`Skipping ${fullPath}: ${e.message}`);
    }
  }
  return result;
}

/**
 * Freeze the entire filesystem.
 */
function freezeFilesystem(interpreter) {
  return freezeDirectory(interpreter, '/');
}

To restore the filesystem you could do something like this:

/**
 * Recursively restore a directory from frozen structure.
 */
function restoreDirectory(interpreter, path, frozen) {
  for (const [name, value] of Object.entries(frozen)) {
    const fullPath = path === '/' ? '/' + name : path + '/' + name;
    if (typeof value === 'object') {
      // It's a directory..!
      try {
        interpreter.FS.mkdir(fullPath);
      } catch (e) {
        // Directory already exists, just ignore.
      }
      restoreDirectory(interpreter, fullPath, value);
    } else if (value.startsWith('FILE:')) {
      // It's a file.
      const [_, mode, mtime, base64] = value.split(':');
      const binary = atob(base64);
      const data = new Uint8Array(binary.length);
      for (let i = 0; i < binary.length; i++) {
        data[i] = binary.charCodeAt(i);
      }
      interpreter.FS.writeFile(fullPath, data);
      interpreter.FS.chmod(fullPath, parseInt(mode));
    } else if (value.startsWith('LINK:')) {
      // It's a symlink. Needs checking.
      const [_, mode, mtime, target] = value.split(':');
      interpreter.FS.symlink(target, fullPath);
    }
  }
}

/**
 * Restore the entire filesystem.
 */
function restoreFilesystem(interpreter, frozen) {
  restoreDirectory(interpreter, '/', frozen);
}

Which is the same thing but in reverse... called like this: restoreDirectory(interpreter, '/', frozen); where frozen is the frozen "object" created by freezeFilesystem.

I've tried my simple PoC code on a page like this:

<!DOCTYPE html>
<html lang="en">
    <head>
        <meta charset="utf-8" />
        <meta name="viewport" content="width=device-width,initial-scale=1" />
        <title>PyScript 2025.11.2</title>
        <link rel="stylesheet" href="https://pyscript.net/releases/2025.11.2/core.css" />
        <script type="module">
          import { hooks } from 'https://pyscript.net/releases/2025.11.2/core.js';

          hooks.main.onReady.add((wrap, element) => {
            const { interpreter, type } = wrap;
            console.log(`${type} ready`);
            console.log(interpreter);
            const frozen = freezeFilesystem(interpreter);
            console.log(JSON.stringify(frozen));
            console.log(restoreFilesystem(interpreter,frozen))
          });

        /* COPY THE JAVASCRIPT FUNCTIONS FOR FREEZE/RESTORE HERE (SEE ABOVE) */
        </script>
    </head>
    <body>
      <script type="mpy">
        print("Hello MicroPython!")
      </script>
      <script type="py">
        print("Hello Pyodide!")
      </script>
    </body>
</html>

I suspect I've missed many edge cases, but at least it works.

This code is merely to illustrate an approach and should not be considered as something production ready. Of course, we'd need some way to extract and zip up the object created by freezeFilesystem, and this can be done if/when we choose to go ahead with this work. But the essentials are all there. 🚀

Thoughts, ideas, constructive critique, and feedback most welcome.

WebReflection · 2025-12-16T22:03:54Z

WebReflection
Dec 16, 2025
Maintainer

btoa(String.fromCharCode(...data))

this is a slippery slope for various reasons:

JS runtimes have an arguments length limit that is not standardized, a bigger file (as in, bigger VFS) would fail here or there
JSON is fully compatible with an array of 256 bytes (aka: Unint8Array) so that using a more fragile atob and btoa based approach, when proper binary is meant, feels both slower and underrated
if it's binary we are saving, let's keep it binary because both JS and Pyodide/MicroPython understand binaries pretty well, with or without JS atob and bota or base64 dependency / transformation involved

That being said, everything else is more than welcome but I need to double check technically this is the best we can offer although I do admire the effort and ideas you put into this to date and I'd be more than happy to finalize a concrete solution around this, of course with your review (or mine) needed to validate we're all good with the idea/proposal and results ... so, thanks a lot for digging this path, it all makes sense to me, the devil is in the details and we should try to provide the best/fastest details we can around it, I hope you would agree on that.

1 reply

WebReflection Dec 16, 2025
Maintainer

forgot to mention, hooks work like that on the main thread but it wouldn't on the worker case, because the interpreter cannot be forwarded as it it to the main thread ... we need to eventually orchestrate this dance somehow out of polyscript bootstrapper for interpreters, but then again, everything else feels and looks sound to me so that shouldn't be an issue, just a "room" to discuss how we want to do that which, if possible in all cases, would be great and I don't see any reason that shouldn't be possible, as long as we agree the base64 convertion in the process might be overkill, not needed!

ntoll · 2025-12-17T04:06:41Z

ntoll
Dec 17, 2025
Maintainer Author

Thanks for the response. Yeah, I know about the btoa problems and, as I mentioned in the call, I used this approach because it was simple and easy and not really important to the core concepts in this discussion (i.e. we can use something else as we refine a potential solution ~ base64 encoding of file content is sub-optimal and was only used for convenience; it also demonstrates my relative lack of JS knowledge in this area, so I'll defer to your suggestions in this regard).

The important point I wanted to make was it appears we could freeze the filesystem relatively easily via the Emscripten FS API.

But as you say, the devil is in the details. But that's where the fun lies. 😉

"Devils to be detailed" that I want to explore 😈:

Do we need to freeze the whole filesystem? Perhaps we just freeze directories on the user's "path"; or WASM-y equivalent thereof. Places like lib/python3.13/site-packages (where packages reside - presumably in the case of Pyodide, we need to check) or the user's home (where files end up) immediately spring to mind. I wonder where mip (in MicroPython) puts its assets? I vaguely remember this might be the user's home directory.
DX/tooling: in a similar way to your recent work around Positron dependency capture via a headless browser, perhaps we could take a similar approach so folks can pip install pysfreeze (for example) and just pysfreeze inside the directory of their PyScript project to produce a project_name.frz "image"..? (.frz for "freeze")
PyScript configuration: we already have entries for packages and files but perhaps we need a restore entry that points to the URL of the project_name.frz image? Also, this should be a logical "OR". Either you have packages and files OR just restore in your PyScript config, and we'll raise an error if you have both.
Definition of idiomatic workflow: clearly you need to specify the packages and files BEFORE you freeze the filesystem. But our tooling could, with the aid of a flag (so the user has a choice), read the referenced configuration file for PyScript and remove the previously needed packages and files entries before replacing them with a restore section pointing at the resulting ./project_name.frz..?

2 replies

WebReflection Dec 17, 2025
Maintainer

Either you have packages and files OR just restore in your PyScript config, and we'll raise an error if you have both.

not super convinced that's going to work ... are you thinking about creating those frozen FS in "localhost" then upload those somewhere and use those? because in this case you'll need to maintain two different configs ... and I'd rather have a restore that, if the file is there, uses it, otherwise it does everything regularly ... that would also allow you to bootstrap, do things, create such frozen FS via some API and download it so you can use it once it's placed in the right place.

perhaps we could take a similar approach so folks can pip install pysfreeze (for example) and just pysfreeze inside the directory of their PyScript project

sure but JS, CSS and other assets don't live inside the VFS, those are needed to be available for the browser engine or no style/js/image will be displayed ... maybe I am missing something here but browser assets are different from VFS entries, a single file that lands on VFS cannot physically land in a way the browser can reach/use those assets

ntoll Dec 17, 2025
Maintainer Author

PyScript config: yeah I was thinking of a localhost based use-case. A cloud based API for creating frozen filesystems is something I hadn't considered. I think we should figure out how to do both and see what folks find most useful. My concern expressed as the logical "OR" in the configuration file was around such settings interfering with each other (which takes precedence? what happens if there's a filename clash? etc). In my mind's eye I was saying, "look you can have X or Y (but not both)". Also, I don't think there need be two config files... see my comment about the freezing process updating the config file automatically for you. Ultimately, I think us having this conversation in the open will help the community see what decisions we need to make, and they'll hopefully weigh in with their preferences. We should listen to them (but that's just normal for us... just stating it for the record so the community know we're listening).

JS/CSS assets: of course... this discussion is ONLY about assets inside the VFS. The problem is, such assets often need to wait on the interpreter to be running before they're downloaded (as many files) and installed every time the page is loaded. As you know, freezing the VFS mitigates this process.

WebReflection · 2025-12-17T09:56:40Z

WebReflection
Dec 17, 2025
Maintainer

@ntoll I took a chance to revisit your logic so these are my variants:

no string concatenation
no string conversion (aka: no base64 involved)
no splits needed
what's passed is the FS not a wrapper that needs to have an FS entry ... this feels better because interpreter in there is never really useful, it's just the FS that is expected to be an Emscripten one, the fact it's reachable via interpreter.FS is irrelevant for this logic

function asNode({ mode, mtime }, data) {
  return { mode, mtime, data };
}

function freezeDirectory(FS, path) {
  const entries = FS.readdir(path);
  const result = {};
  for (const entry of entries) {
    if (entry === '.' || entry === '..') continue;
    const fullPath = path === '/' ? '/' + entry : path + '/' + entry;
    try {
      const stat = FS.stat(fullPath);
      if (FS.isDir(stat.mode)) {
        // Directory: recursively freeze its contents.
        result[entry] = asNode(stat, freezeDirectory(FS, fullPath));
      }
      else if (FS.isFile(stat.mode)) {
        const data = FS.readFile(fullPath, { encoding: 'binary' });
        // TODO: the [...array] conversion *might* not be necessary,
        //       although this way it works as JSON too.
        result[entry] = asNode(stat, [...data]);
      }
      else if (FS.isLink(stat.mode)) {
        result[entry] = asNode(stat, FS.readlink(fullPath));
      }
    }
    catch (e) {
      // Log stuff that can't be accessed.
      console.warn(`Skipping ${fullPath}: ${e.message}`);
    }
  }
  return result;
}

function freezeFileSystem(FS) {
  return freezeDirectory(FS, '/');
}

function restoreDirectory(FS, path, frozen) {
  for (const [name, node] of Object.entries(frozen)) {
    const fullPath = path === '/' ? '/' + name : path + '/' + name;
    if (FS.isDir(node.mode)) {
      try {
        FS.mkdir(fullPath);
      } catch (e) {
        // Directory already exists, just ignore.
      }
      restoreDirectory(FS, fullPath, node.data);
    }
    else if (FS.isFile(node.mode)) {
      FS.writeFile(fullPath, Uint8Array.from(node.data));
      FS.chmod(fullPath, node.mode);
    }
    else if (FS.isLink(node.mode)) {
      FS.symlink(node.data, fullPath);
    }
  }
}

function restoreFileSystem(FS, frozen) {
  return restoreDirectory(FS, '/', frozen);
}

now ... this version of mine:

in comparison, for some reason, your version fails at restoring so that all timings are kinda irrelevant but I still believe my version is easier to reason about, there's less magic involved, those buffers will compress well after gzip ... what do you think?

your screenshot, if interested:

2 replies

WebReflection Dec 17, 2025
Maintainer

P.S. it is very possible that the zip file is too big to be handled as String.fromCharCode(...) data, reason I believe we don't need to use base64 at all, we already have a uint8 view of the binary content and that's all we need, nothing else, really.

ntoll Dec 17, 2025
Maintainer Author

Bravo - you've taken my knackered X-wing from the swamp that is my JS code, and made it fly. 🤣

Your version is MUCH better than mine.

WebReflection · 2025-12-17T10:26:53Z

WebReflection
Dec 17, 2025
Maintainer

OK, keeping the discussion relevant, now that we have an easy/fast enough way to produce a fs.json file #2417 (comment) that works with any file size, what I'd like to test now is how long would it take to have matplotlib as package so that:

we use our current cache to compare bootstrap time
we freeze that into an fs.json
we change the config to not include matplotlib, yet ...
we use the very same code to import from it and test that everything is fine

My concerns I'd like to validate is that micropip might be deeply integrated in Pyodide, as example, so that maybe if modules don't get to be requested via it or registered somehow, things might break ... but I think we need to be sure that once we manage to freeze the FS, bootstrapping from it will actually work.

In an ideal world, we should be able to bootstrap the FS ourselves automatically, but let's do one step after the other ... I will update this with results, once I have some!

8 replies

WebReflection Dec 17, 2025
Maintainer

forgot to mention one final thought around this: if the file is already in the FS we should probably not try to rewrite it, so we can skip files that are inevitably there every single time Pyodide or MicroPython bootstrap ... I don't know how much faster that would be, it's not currently the bottleneck, but something to keep in mind (imho).

WebReflection Dec 17, 2025
Maintainer

problematic in what sense?

to start with, you cannot push more than 50MB of file on GitHub Pages, that's the very first issue: it's too big as FS!

secondly, it takes 700ms+ to just fetch that file locally ... true that if cached it should be not relevant, but at the same time, I am not sure common servers do provide hard Etag cache out of the box (maybe http.server does?)

Still I believe we should compress that file ... I'll do that and see how that goes, more results are coming!

WebReflection Dec 17, 2025
Maintainer

ouch ... it took 3.8 seconds to stringify and compress (although it produced a lovely 18.5MB .gz file) and the decompression story is very similar to the previous one ...

as summary:

if we compress the JSON format (we should) it might take long time ... this is a "one off" operation though so it might be OK
what we save in fetching the compressed fs is then slowed down by decompressing it ... the sum is slightly faster than just JSON but I was hoping to cut that time in half, not just drop 10% of perf
because big files are not super welcomed by CDNs and/or GitHub, I don't think avoiding compression should be considered as an option ... on the bright side, we have all it takes to orchestrate this dance via native browser APIs, on the other hand it's clear to me the way we're storing the tree as JSON requires extra computation that shouldn't be needed ...

My next step would be to flatten out the result of the crawling of the file system to have a unique buffer instead of JSON so that no parsing (stringify/parse) would be needed and we could stream the restoring operation ... I think this is worth exploring because it should defeat most painful points so far investigated.

ntoll Dec 17, 2025
Maintainer Author

This is great to see coming along!

Looking at the results and digesting what you say, I agree that flattening the results in a binary fashion is a good next step... especially (if I understand you correctly) we can start to work with the file while it's still downloading.

I also think many of the large files/directories on the browser's virtual filesystem will be Pyodide related things, Pyodide will put there in any case. Doing some refinement to only capture those aspects of the VFS that relate to the packages and files that the user has specified will likely save us a LOT of space.

WebReflection Dec 17, 2025
Maintainer

FYI I have a prototype of a Blob flattened binary thing that takes nothing to both create the FS and restore it ... the bad news is that I have some path issue I need to figure out but compared to 125MB of JSON we are down to 38.5MB of blob and the content is, theoretically the same ... we have (without compression) 100ms+ to fetch and 100ms+ to restore the whole thing and streaming is not even in there ... the TL;DR of this comment is that: "I'm working on it, it's looking really promising but I need it to work first, then we can decide/celebrate, or both!"

WebReflection · 2025-12-18T13:37:42Z

WebReflection
Dec 18, 2025
Maintainer

Update

I have everything finally working as desired:

the operation takes a few milliseconds to create a Blob (which carries an ArrayBuffer) that contains all details for files, directories, and links
the buffer is linear and using matplotlib as imported package, it produces 39MB of raw file VS 125MB that JSON for the very same FS graph
once compressed and saved as fs.gz smart CDNs or even GitHub pages will not re-compress it, and we'll be down to 14MB instead
with both compression and fetching it takes a few milliseconds to retrieve the frozen FS
to restore 39MB of data it takes consistently less than 100ms ... meaning Pyodide with matplotlib and all related dependencies can bootstrap in less than 2 seconds, fetching included 🥳
if the blob is stored into IndexedDB it takes nothing to read it (not compressed) and restore it

I've published a module so I could test online and on GitHub pages with ease, the module is called emscripten-fs-blob and you can test it live on GitHub pages (remember to open the devtools console).

That live demo bootstraps pyodide with matplotlib and everything else without needing a config to do so, and it "smokes" any other variant/cached alternative we offered to date but keep in mind we are using IndexedDB to cache the frozen micropip env, here we're using a fetch to retrieve the data.

If needed, I can improve the test page to compare gzip VS plain Blob so we can see how the browser cache could help too in there because decompressing is actually not super fast despite me using the latest/greatest Web APIs to do so.

The streaming bit has been left behind because pipeThrough is super slow with native APIs, I won't expect me implementing some convoluted logic around it being faster than just "fetch it all and parse", which is the current approach (edit: approach that is still compatible with streaming as bytes are ordered and it's linear 0 to buffer size operation).

Here a screenshot of various tiny benchmarks and related details (around operations):

I am confident you'll have similar results on your machine but we all know CPU matters specially for decompression related operations, yet I am extremely happy and glad this variant of @ntoll original idea really works well.

To be discussed

Part of the final size amount is due mtime which is nowhere used ... I think it was a good idea to have it in there, but I am not fully sure (if we don't use it) why that's part of the buffer ... I can see moreorchestration around the fact if such mtime (last modified time) is less than current file/folder/link, because that also exists, we could skip override and just use latest, resulting in hopefully less conflicting situations where Pyodide might be a different version and overwriting stuff that's newer might cause troubles because here we freeze the whole thing and the whole thing doesn't get versioned in the FS (i.e. site-packages for python3.13 ... that doesn't reflect the Pyodide version anywhere).

As summary

yes, we can freeze the whole FS in a fast and portable way
yes, frozen FS can be bundled within an app, for both Briefcase/Positron and PWA or any other offline/online scenario
the binary (Blob/Buffer) format wins for most operations except it won't be "human readable" like JSON is, although I believe that's a good thing
the project works on the front-end like in the backend, this opens doors to Pyodide frozen FS out of Cloudflare Workers and whatnot, but it also can work wonderfully on PyScript .com or GitHub pages

Happy to clarify or expand on any point, right now I hope this is a welcome feature pre x-mas time, and that people would play around and see how powerful is this possibility to entirely freeze a whole environment so that it will reproduce exact same results every single time it gets to be bootstrapped 👋

2 replies

ntoll Dec 18, 2025
Maintainer Author

Bravo Andrea. This is going to be fun to integrate with PyScript.

I wonder about selective and sensible choices for directories to archive. As I said before, only the packages and files aspects configured for PyScript really need to be a part of this. E.g. I suspect we're archiving all Python's standard library (not necessary!) and so on. 😉

But this is really great to see come together. That's another X-wing lifted out of the swamp... 🚀

WebReflection Dec 18, 2025
Maintainer

that's very true and it would save also quite a lot of data because just Pyodide with its zipped core packages assets is a bit heavy on the buffer but also inevitably already there ... so here the catch:

this approach coud allow us to entirely bypass Pyodide bootstrap but we need coordination with Pyodide folks themselves to understand how a Pyodide bootstrapped with a foreign FS could work, because ...
if we could tell Pyodide and MicroPython which FS should be used, as opposite of bootstrapping their own Emscripten FS, we could have variants that don't include Emscripten FS related code (which is not super tiny, yet awesome) and be able to let these runtimes have everything they expect to have already in place ... including their own pre-compiled files and whatnot
regardless, we could (and should?) offer a way to exclude certain files and/or folders, because it's easier to know core files than guess what are all the directories, files and/or symlinks that matplotlib or any of its dependent packages land ... that would be site-packages as example, which is YAGNI as Pyodide would inevitably (so far) provide that anyway, or other very runtime specific folders and files we know that wouldn't need to be replicated.

I am down and game with this idea, what matters now is that we are unlocking new bootstrap and share-ability potentials, and all the dots seem to be converging gracefully without issues!

dpgeorge · 2025-12-18T13:44:14Z

dpgeorge
Dec 18, 2025

Wow, this feature has really developed quickly!

I think what you've got here suits PyScript well. But let me just say a few words related to MicroPython and its existing ROMFS feature:

A ROMFS is created "offline", currently using mpremote romfs build <directory>. It just scoops up the entire directory and turns it into a frozen filesystem.
The resulting file/data then needs to be made available on the target you want to use it with. For microcontrollers, that means writing the data into flash somewhere. For webassembly, it means having the data as a Uint8Array accessible from Emscripten.
A key feature of ROMFS is that it's read-only-memory and actually must be available in addressible memory, that MicroPython can read using standard memory read operations. That leads to a lot of nice optimisations: eg (1) precompiled bytecode is executed inplace from the ROMFS without any need to copy it into RAM; and (2) large assets like fonts and images can stay in the ROMFS and be used directly without loading them into RAM either.
The benefits of (3) are also available to MicroPython webassembly: precompiled bytecode is instantly executable, and assets are essentially already loaded into memory, ready to use.

So I think ROMFS would still have its place in MicroPython webassembly, independent to PyScript's frozen filesystem feature. Or maybe we could somehow make a PyScript frozen filesystem be also memory mappable... then bytecode could be executed directly from it.

I have a branch with ROMFS enabled for the webassembly port here: https://github.com/dpgeorge/micropython/tree/webassembly-combined-patches . To use it it's very simple, you just need to load a ROMFS as a Uint8Array and then pass it into the interpreter constructor, eg:

const { loadMicroPython } = await import(`${base}/micropython.min.mjs`);
const romfs = new Uint8Array(await (await fetch("app.romfs", {responseType:"arraybuffer"})).arrayBuffer());
const mp = await loadMicroPython({ url: `${base}/micropython.wasm`, romfs: romfs });

2 replies

ntoll Dec 18, 2025
Maintainer Author

...and the Christmas gifts just keep on arriving. Bravo Damien. 🎅

Let's circle back after the Christmas festivities, take all the work and figure out how the story is told inside PyScript.

I'm excited by how helpful this will be for ease of deployment and speed of start-up.

Regarding "offline" creation - i.e. via tooling, Andrea has already done a similar trick with some Beeware related work he's been experimenting with. TL;DR - a headless browser is use via a CLI to "do the stuff" inside PyScript, and from which the useful results and assets can be extracted.

WebReflection Dec 18, 2025
Maintainer

thanks @dpgeorge and if I can ask a question:

Or maybe we could somehow make a PyScript frozen filesystem be also memory mappable

the current buffer implementation is a flat stream of data and each data could be mapped at specific byte-offset location (files, directories, symlinks) ... is that what you are after?

I haven't implemented a mapping of the FS in the current logic but having another step that produces "paths to point in memory with buffer boundaries" doesn't seem like something too hard to implement on top of what I have already ... although I have no idea if what I am describing makes sense, or if I actually even understood what's mapping means for you so thanks in advance for any possible further clarification around this topic ... after all, a ROM in the good'ol Gameboy cartridge" fashion is pretty much what we're after or what my current module has created 😅

Frozen Filesystems for Faster PyScript Startup #2417

Uh oh!

ntoll Dec 16, 2025 Maintainer

Replies: 6 comments · 17 replies

Uh oh!

Uh oh!

WebReflection Dec 16, 2025 Maintainer

Uh oh!

WebReflection Dec 16, 2025 Maintainer

Uh oh!

ntoll Dec 17, 2025 Maintainer Author

Uh oh!

WebReflection Dec 17, 2025 Maintainer

Uh oh!

ntoll Dec 17, 2025 Maintainer Author

Uh oh!

WebReflection Dec 17, 2025 Maintainer

Uh oh!

WebReflection Dec 17, 2025 Maintainer

Uh oh!

ntoll Dec 17, 2025 Maintainer Author

Uh oh!

WebReflection Dec 17, 2025 Maintainer

Uh oh!

WebReflection Dec 17, 2025 Maintainer

Uh oh!

WebReflection Dec 17, 2025 Maintainer

Uh oh!

WebReflection Dec 17, 2025 Maintainer

Uh oh!

ntoll Dec 17, 2025 Maintainer Author

Uh oh!

WebReflection Dec 17, 2025 Maintainer

Uh oh!

Uh oh!

WebReflection Dec 18, 2025 Maintainer

Update

To be discussed

As summary

Uh oh!

ntoll Dec 18, 2025 Maintainer Author

Uh oh!

WebReflection Dec 18, 2025 Maintainer

Uh oh!

dpgeorge Dec 18, 2025

Uh oh!

ntoll Dec 18, 2025 Maintainer Author

Uh oh!

WebReflection Dec 18, 2025 Maintainer

ntoll
Dec 16, 2025
Maintainer

Replies: 6 comments 17 replies

WebReflection
Dec 16, 2025
Maintainer

WebReflection Dec 16, 2025
Maintainer

ntoll
Dec 17, 2025
Maintainer Author

WebReflection Dec 17, 2025
Maintainer

ntoll Dec 17, 2025
Maintainer Author

WebReflection
Dec 17, 2025
Maintainer

WebReflection Dec 17, 2025
Maintainer

ntoll Dec 17, 2025
Maintainer Author

WebReflection
Dec 17, 2025
Maintainer

WebReflection Dec 17, 2025
Maintainer

WebReflection Dec 17, 2025
Maintainer

WebReflection Dec 17, 2025
Maintainer

ntoll Dec 17, 2025
Maintainer Author

WebReflection Dec 17, 2025
Maintainer

WebReflection
Dec 18, 2025
Maintainer

ntoll Dec 18, 2025
Maintainer Author

WebReflection Dec 18, 2025
Maintainer

dpgeorge
Dec 18, 2025

ntoll Dec 18, 2025
Maintainer Author

WebReflection Dec 18, 2025
Maintainer