Skip to content
Snippets Groups Projects
  1. May 11, 2021
  2. May 10, 2021
  3. May 09, 2021
  4. May 08, 2021
  5. May 07, 2021
  6. May 06, 2021
  7. May 05, 2021
  8. May 01, 2021
    • cheng cheng's avatar
      Insert double pipeline buffer between adjacent stage (#4798) · c28bacad
      cheng cheng authored
      
      * Pipeline Parallelism: checkpointing insert identity buffer op
      
      * fix complier err
      
      * identity buffer op custom out regst num
      
      * fix bug and runnable
      
      * Chain merge divide fw/bw; MemChain ignore merge; copyhd regst num hack
      
      * Pipeline buffer pass
      
      * Pipeline runnable
      
      * rollback NOT merge mem chain hack
      
      * pipeline_stage_id_hint and rollback checkpointing buffer
      
      * Pipeline buffer only. test pass.
      
      * rollback repeat hack
      
      * Remove CopyHd Hack; Add buffer cross label loader and loss
      
      * InsertNcclLogicalOps support multi-subgraph by placement group
      
      * Implement of InsertNcclLogicalOpPass support multi-subgraph and batch acc and pipeline parallel
      
      * Pipeline + 2D-SBP runnable
      
      * remote note
      
      * WARNING to INFO
      
      * refine code for review & fix for new dtype infer
      
      * add note
      
      * Hotfix ib qp max_send_wr
      
      * Pipeline Debug: hack copy hd/commnet regst num to 2 and increase buffer regst num +1
      
      * add buffer op for copyD2H pipe
      
      * fix bug of insert buffer in backward pass stages
      
      * rollback buffer hack
      
      * buffer size = total stage num
      
      * Hack buffer size of loader data/label
      
      * Double buffer
      
      * fix bug of check
      
      * hack buffer size
      
      * Fix buffer connected bug. and refine buffer size
      
      * collection reserve for saving rehash cost
      
      * refine code and note
      
      * set dst buffer size to max
      
      * refine dst buffer size
      
      * Refine Backward buffer size
      
      Co-authored-by: default avatarliujuncheng <liujuncheng1022@gmail.com>
      Co-authored-by: default avataroneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
      c28bacad
    • Shenghang Tsai's avatar
      dont use import to determine if export onnx interface (#4801) · f0c6248c
      Shenghang Tsai authored
      
      * dont use import to determine if export onnx
      
      * fix
      
      Co-authored-by: default avataroneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
      f0c6248c
    • cheng cheng's avatar
      Feat: InsertNcclLogicalOps support multi-subgraph by placement group (#4753) · 38dc8377
      cheng cheng authored
      
      * Pipeline Parallelism: checkpointing insert identity buffer op
      
      * fix complier err
      
      * identity buffer op custom out regst num
      
      * fix bug and runnable
      
      * Chain merge divide fw/bw; MemChain ignore merge; copyhd regst num hack
      
      * Pipeline buffer pass
      
      * Pipeline runnable
      
      * rollback NOT merge mem chain hack
      
      * pipeline_stage_id_hint and rollback checkpointing buffer
      
      * Pipeline buffer only. test pass.
      
      * rollback repeat hack
      
      * Remove CopyHd Hack; Add buffer cross label loader and loss
      
      * InsertNcclLogicalOps support multi-subgraph by placement group
      
      * Implement of InsertNcclLogicalOpPass support multi-subgraph and batch acc and pipeline parallel
      
      * Pipeline + 2D-SBP runnable
      
      * remote note
      
      * WARNING to INFO
      
      * refine code for review & fix for new dtype infer
      
      * add note
      
      * collection reserve for saving rehash cost
      
      Co-authored-by: default avataroneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
      38dc8377