1. Halo Guest, pastikan Anda selalu menaati peraturan forum sebelum mengirimkan post atau thread baru.

Cara gampang scrap gugel

Discussion in 'Pemrograman Web' started by mp3online, Dec 10, 2013.

  1. mp3online

    mp3online Super Hero

    Joined:
    Jul 19, 2011
    Messages:
    2,228
    Likes Received:
    294
    Location:
    jakarta
    mudah2an aja gak repost.
    kalau repost dihapus aja ya mod.
    langsung aja kodenya kayak gini:
    PHP:
    $html file_get_contents('http://www.google.com/custom?hl=en&q=usher+mp3');
    $dom = new DOMDocument;
    $dom->loadHTML($html);
    foreach(
    $dom->getElementsByTagName('h2') as $node) {
        
    $array[] = $dom->saveHTML($node);
    }
    kalau mau lihat hasilnya tinggal tambahin print_r($array);

    tinggal ngolah deh arraynya mau diapain
     
  2. fwasono

    fwasono Newbie

    Joined:
    Oct 22, 2013
    Messages:
    4
    Likes Received:
    1
    hmm.. keluarannya masih dalam bentuk text ya mas gan ?, gak di kasih filetype:mp3 aja sekalian klo memang domainnya mp3
     
    nekaters likes this.
  3. kited

    kited Ads.id Fan

    Joined:
    Dec 3, 2011
    Messages:
    178
    Likes Received:
    2
    Location:
    jakarta
    Tu kode taro dmna?


    Sent from my GT-S7270 using Tapatalk
     
    nekaters likes this.
  4. Daeng Aco

    Daeng Aco Super Hero

    Joined:
    Jul 1, 2012
    Messages:
    1,146
    Likes Received:
    329
    masih bingung :D
     
  5. amild

    amild Ads.id Pro

    Joined:
    Nov 29, 2012
    Messages:
    347
    Likes Received:
    71
    Location:
    Makassar
    Mantap nih. Hasilnya menampilkan judul dan url berdasarkan keyword yang dicari.
    hemmmm...kayaknya bisa dijadiin Auto-autoan hehehee :))
     
  6. p3tir

    p3tir Super Hero

    Joined:
    Jan 3, 2013
    Messages:
    1,119
    Likes Received:
    84
    idem,
    naronya di mana kodenya?
     
  7. ithoib

    ithoib Super Hero

    Joined:
    Jan 10, 2011
    Messages:
    789
    Likes Received:
    295
    Location:
    Kota Angin
    taroh dimana aja kayaknya gan...

    btw thanks bgt triknya, langsung dipraktekkan
     
  8. Al theradz

    Al theradz Hero

    Joined:
    Oct 25, 2013
    Messages:
    710
    Likes Received:
    20
    Location:
    Wonosobo - Yogyakarta
    Coba mantau dulu, msh bingung :D
     
  9. mp3online

    mp3online Super Hero

    Joined:
    Jul 19, 2011
    Messages:
    2,228
    Likes Received:
    294
    Location:
    jakarta
    keluarannya itu udah array boss.
    itu kan cuma contoh url gugel beserta query searchnya, silakan kembangkan sendiri sesuai target yg diinginkan.

    iya keywordnya juga udah dibold.

    bener banget, taroh di mana aja.
    itu kan script standalone, bisa masuk kmana aja, mau dibikin jadi plugins wp jg bisa :)
     
  10. prince

    prince Ads.id Fan

    Joined:
    Sep 1, 2013
    Messages:
    235
    Likes Received:
    11
    ane lumayan sering manual search query pake browser kena notifikasi "unusual traffict from your network" trus sama simbah disuguhi captcha ...
    Ini yang pake script kalo dijalanin terus apa emang bisa awet om ?



    Sent using Tapatalk 2
     
  11. dhevganx

    dhevganx Ads.id Fan

    Joined:
    Dec 8, 2009
    Messages:
    235
    Likes Received:
    53
    Location:
    A43T
    jelas ngga.. hehehe.. ide tambahannya scrap ke SE yg lain.. ato mo yg ribet lg.. dirandom scrap ke SE yg mana.. klo arraynya kosong dirandom lagi pilih SE lg.. jadi ip address kita lebih kecil kemungkinannya di blok alias kluar capcay-nya

    cmiiw
     
  12. go.dre.am

    go.dre.am Ads.id Pro

    Joined:
    Jun 4, 2011
    Messages:
    376
    Likes Received:
    61
    Location:
    www.tetuku.com
    setahu saya kalau tes di browser gpp. tapi kalau di curl langsung kena capcay. om gugel mencocokkan cookie dari client side (javascript) dan dari server side(browser) . ada yang bisa mengatasi?
     
  13. mp3online

    mp3online Super Hero

    Joined:
    Jul 19, 2011
    Messages:
    2,228
    Likes Received:
    294
    Location:
    jakarta
    kalau aku sih untuk urusan scrap bisa dibilang gak pernah pake file_get_contents, pake curl jg jarang.
    aku lebih suka pake snoopy.
    itu aku ngasih contoh pake file_get_contents biar simpel aja :)

    alhamdulillah sih pake snoopy punyaku awet blm pernah kena kecap :)
     
    dhevganx likes this.
  14. suriemie

    suriemie Ads.id Pro

    Joined:
    Sep 15, 2006
    Messages:
    404
    Likes Received:
    53
    kalo pake snoopy biar ga kena kecapnya si mbah harus pura2 jd browser ga (pake user agent)?
     
    Last edited: Dec 11, 2013
  15. mp3online

    mp3online Super Hero

    Joined:
    Jul 19, 2011
    Messages:
    2,228
    Likes Received:
    294
    Location:
    jakarta
    iya, lengkap ama cuki dll
     
    suriemie likes this.
  16. andiklive

    andiklive Super Hero

    Joined:
    Feb 4, 2012
    Messages:
    1,570
    Likes Received:
    103
    Location:
    Kurniawan Technologies
    wuih mantep nih gan tipsnya,

    btw google kalau di DOM bakal di ban gak yah ip host kita.

    ane pernah DOM blognya orang, ip ane malah block...
     
  17. mp3online

    mp3online Super Hero

    Joined:
    Jul 19, 2011
    Messages:
    2,228
    Likes Received:
    294
    Location:
    jakarta
    gak tau kalo trafiknya rame banget boss, kalo pageview sekitar 1k per hari sih masih aman.
    tapi kalo diban ip jg msh bisa diakalin pake proxy :)
     
  18. solik

    solik Super Hero

    Joined:
    Nov 12, 2007
    Messages:
    899
    Likes Received:
    28
    nggk semua server ngijinin pake file_get_contents, alternatif pake curl
    klo di cakephp pake http socket.

    thanks info nya
     
  19. dhevganx

    dhevganx Ads.id Fan

    Joined:
    Dec 8, 2009
    Messages:
    235
    Likes Received:
    53
    Location:
    A43T
    jadi pengen oprek scrap2an gini setelah dikasitau cluenya .. hehehe.. biasanya cuman parsing2an xml ala jevuska, jadi bnyk ide baru lewat scrap :v :v

    trims mas
     
  20. mp3online

    mp3online Super Hero

    Joined:
    Jul 19, 2011
    Messages:
    2,228
    Likes Received:
    294
    Location:
    jakarta
    kalau hosting berbayar sih biasanya file get content, curl & socket diijinin smua.
    kalau hosting gratisan byet gak bisa pake file get content tapi bisa pake curl & socket.
    snoopy pakenya fsockopen, di wp juga udah include snoopy nya tinggal makai.



    metode scrap ada banyak boss, ada lagi yg dom ditambah xpath.
    kalau yg pake xpath aku blm begitu mudeng :)

    ada jg php class html2array dll
     

Share This Page