The team continues to refine our core infrastructure and boost performance across Gemma3 / zkML Interface key modules. Here’s a quick look at what’s been built and improved this week.
2/ Gemma3 Performance: Quantized Gemma3 model currently includes nearly 10 000 nodes; kernelized execution shows limited performance due to excessive node granularity.
3/ Gemma3 Refactor: Analyzed model structure and found most nodes are shape-related and redundant—potentially removable. In the ideal case, over 90 % of nodes can be eliminated.
4/ zkML Iface Latency Optimization: Refactored zkmlface codebase, cutting inference latency down to tens of milliseconds. The interface is not yet connected to the TEE environment.
5/ Next Steps: Deploy the optimized zkmlface on a GPU TEE-enabled machine once available. Compile the pruned Gemma3 graph into high-efficiency GPU kernels for integration testing. Stay tuned for more updates
3,031
19
本頁面內容由第三方提供。除非另有說明,OKX 不是所引用文章的作者,也不對此類材料主張任何版權。該內容僅供參考,並不代表 OKX 觀點,不作為任何形式的認可,也不應被視為投資建議或購買或出售數字資產的招攬。在使用生成式人工智能提供摘要或其他信息的情況下,此類人工智能生成的內容可能不準確或不一致。請閱讀鏈接文章,瞭解更多詳情和信息。OKX 不對第三方網站上的內容負責。包含穩定幣、NFTs 等在內的數字資產涉及較高程度的風險,其價值可能會產生較大波動。請根據自身財務狀況,仔細考慮交易或持有數字資產是否適合您。