PrevPrev Go to previous topic
NextNext Go to next topic
Last Post 26 Aug 2024 01:27 PM by  Patrick Ng
Maximize LLM throughput - reminiscence of Sign-bit data processing
 0 Replies
Sort:
You are not authorized to post a reply.
Author Messages
Patrick Ng
Basic Member
Basic Member
Posts:150


--
26 Aug 2024 01:27 PM
    Bits and Pieces history - recall IBM-360 / 370 mainframe used to process seismic data had 16-bit; not until IBM-390, we could finally handle full 32-bit floating-point data. For true amplitudes and amplitude-versus-offset analysis to infer rock properties, that is a must.


    LLM quantization v sign-bit* processing Comp - Can't help to make the following comparison when it comes to the number of bits to make LLM run faster and cheaper today, just as seismic data processing in 1973.
     
    A) On one hand, here is a post on LLM infrastructure optimization 08.23.2024 from Google.

    https://cloud.google.com/...r-llm-serving-on-gke

    Quantization uses fewer bits, "newer model checkpoints are already published in 16-bit precision" and even drop down to 4-bit. "Recommendation - 1: Use quantization to save memory and cost. If you use less than 8-bit precision, do so only after evaluating model accuracy."

    B) On the other hand, paper on using sign bits in GEOPHYSICS vol. 38, Issue 6, December 1973,

    https://library.seg.org/doi/10.1190/1.1440394

    "The statistical properties of sign‐bit semblance were such that this system could do a velocity analysis and an interpretation with no human intervention. In this mode of operation it yielded state‐of‐the‐art accuracy at greatly increased speed and with greatly reduced storage requirements."

    Q: so can we use sign-bit for LLM quantization in all matters of GenAI training?

    That will be one for the IMAGE audience to take on.

    *Copilot assist - sign bit is a specific use of a one-bit representation, but not all one-bit representations necessarily serve as sign bits.
    0
    You are not authorized to post a reply.


    Moderators

    Andrew Munoz Fuego Exploration
    Patrick Ng Shaleforce

    Headquarters Contacts

    Susan Nash Director, Innovation and Emerging Science and Technology AAPG