Find Jobs
Hire Freelancers

RSS Feed Extractor

$10-10000 USD

Terminado
Publicado hace alrededor de 10 años

$10-10000 USD

Pagado a la entrega
A Utility to Find All RSS Feeds in a Domain P9 Project General Information: P9 is the code name for an information based internet and social media start-up that has developed and proved the alpha version of the service and is now improving and adding components for the launch version. There are many components for this service and we will put certain definable components out for bid. We may also invite people who have completed these components to join the project on a permanent basis. Job Overview: Create a utility that identifies and extracts all of the RSS or similar feeds in a domain. Job: P9-Data Input Component-RSS feeds module-Extraction from a URL Technology: • Java 7 (not Java 8). • See Technology requirements below. If the design of the program requires additional software developed by third parties, the software must be open source and we must approve of the license(s) for any such software used in the program. Description of Work: Create a program that processes a list of URLs and finds for each URL all RSS, Atom and similar feeds reachable from the given domain but only in pages in the given domain. (In this document, “RSS” refers to RSS, Atom and any other similar feed/link that provides news story data). All classes shall reside in the package com.c2g2.p9.rss.web.crawler. In this package, provide a class called Main to drive the process, you can organize the code in this package as you see fit. Main takes the following command line arguments: -input a URL to the input CSV file (required) -output a URL to the output CSV file (required) -fail a URL to the failure output CSV file (required) -help prints command line help (optional) For each URL in the input file, the program crawls the whole site and looks for all RSS URLs. If there is a problem processing a URL from the input file, log an error, and continue to the next record. Unless you can prove otherwise, each URL that the program finds has to be read and fed to a RSS parser to determine if the link reads a valid RSS feed. The program can take shortcuts, for example, URLs ending in .html, .gif, .jsp and so on can be skipped. The list of file extensions to skip should be places in a [login to view URL] file in the class path and contain for example: skip-ext-list = .jsp .html .html .gif .jsp .png .jpeg .asp .aspx The program should smartly cache already visited URLs for a given domain to avoid fetching a URL over and over again. When the program finds a valid RSS feed, it should write an output CSV record with the following columns: 1. URL from input file 2. ISO-8601 timestamp in the combined date and time with UTC format. For example: 2014-03-21T00:47:06Z 3. RSS feed URL The column names are: 1. Domain 2. Timestamp 3. RSS When the program cannot reach a domain from the input file, it should write an output CSV record with the following columns: 1. URL from input file 2. ISO-8601 timestamp in the combined date and time with UTC format. For example: 2014-03-21T00:47:06Z The column names are: 1. Domain 2. Timestamp SEE ATTACHED DOCUMENT
ID del proyecto: 5717636

Información sobre el proyecto

13 propuestas
Proyecto remoto
Activo hace 10 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
Adjudicado a:
Avatar del usuario
I have put in a reduced bid now. Normally prices could have been lowered further, but since you are asking for unit tests, PMD and Checkstyle, etc. I have quoting a slightly higher price. About me: I have about 15+ years of experience in Java and Web technologies. My complete profile is as available on LinkedIn: [login to view URL] If you wish, do reach out to me on Skype: [login to view URL] or email: neelancer at outlook dot com. Looking forward to hearing from you.
$300 USD en 7 días
4,9 (15 comentarios)
5,2
5,2
13 freelancers están ofertando un promedio de $5.414 USD por este trabajo
Avatar del usuario
Hello , We have a team of Skilled Java-J2EE professionals with experience up to 8 yrs. You will be able to directly communicate with our technical expert. Our Java Expertise: 1) Desktop Applications: Swing, Eclipse Rich Client Platform, AWT, SWT, RMI 2) Frameworks: Core java, Advance java, Spring Core, Hibernate Core 3) Tools: JNDI, Xml, Java Mail, Java Applets, Java Web Start 4) Databases: SQL Server 2000/2005, MySQL 4.x/5.x, Oracle 8i/9i/10g/11g, Postgre SQL 8.2 5) Web Services: SOAP, WSDL, RESTFUL Web Services, Apache Axis 6) IDE: Eclipse, Net Beans, Web Ratio (Model based application development IDE) , Spring IDE 7) Source Control: CVS, SVN More details will be provided on request. By doing this work, we are interested in developing long term relationship by displaying our quality. Thanks for reading our proposal. Regards.
$515 USD en 12 días
4,9 (164 comentarios)
7,8
7,8
Avatar del usuario
Hi , I'm java software architect with more than 7 years experience. I have experience with different kind of project, from Standalone desktop application to web application with very great performance. For realization this project i'm plane to use "apache nutch" web crawler for search all RSS urls. All other stack of technology i will be take from your description below. The estimate for this project about 3 weeks. And i will be provide full scope of realization which you require. Also i have couple of question: What size of incoming cvs file do you plan to use? Do you need any indexing of proceeded URL from previews files? Do you need any indexing of URL which was processed in previous files? I can't attach the my cvs file to this proposal so if you answered to me i will be give it to your. This project is very interesting for me and I want to participate in it Feel free ask any question. Viktor
$8.842 USD en 24 días
5,0 (16 comentarios)
6,5
6,5
Avatar del usuario
Hello, I'm a Java, J2EE application developer with over 10 years experience. I'm very happy to work for you. Check my profile, see how other project owners said about my service. Contact me if you are interested. Thanks, Rick
$8.421 USD en 50 días
4,9 (20 comentarios)
5,9
5,9
Avatar del usuario
Can help... I am an Expert... Lets Start! Please start a Discussion with me and we can get started from there... Please check the past projects I have handled and check my reviews for what employers have to say about my work... Can start right now...
$8.000 USD en 30 días
4,8 (23 comentarios)
5,6
5,6
Avatar del usuario
Dear Buyer, I have over 5 years of enterprise Java experience and I can help you out with this problem. Sincerely, Erko
$2.998 USD en 15 días
5,0 (1 comentario)
2,1
2,1
Avatar del usuario
Dear Sir/Madam, I worked with several Multinational Companies for the last 8 years and have recently taken up freelancing as a career option for its obvious benefits of higher earnings and convenient work schedules. I can assure you of timely delivery, better quality and optimal performance with any of my works. I have worked extensively on web crawling and have been working on my own vertical-search-engine in my spare time; moreover I have built a Java library for processing RSS+ATOM feeds as a pet project some time back. So, you can be rest assured that your work is in safe hands which very well knew the nuances of web crawling and RSS/ATOM feeds before hand. You may find that i have no previous freelancing jobs done but as I said I earlier, I have quit my regular job to start freelancing only recently and hence may look a bit novice as far as freelancing is concerned. However, I promise to charge no more than the initial (minimum) payment if you are not satisfied with my work, and i am pretty sure you will hire me again once you see my work. You can see my profile to know more about me and my past work.
$5.555 USD en 30 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
Dear Sir, I'm quite interested by your project. I am programmer with sincere and capable. I have developed many project. Particular, I have rich experience in Java/J2EE/Jsp/Struts/Spring/Hibernate project. And I have experience in RSS Feed Extractor I can provide you the best results. If you had interest for my suggestion, please call me. My S.k.y.p.e id is "sweetdreamp201" I await for you. Best Regards.
$10.526 USD en 100 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
Hi I am export in web crawling and web scraping and I have done so many work related to this project. I want to do this project. I am looking forward to your response. Thanks Amit Ku Behera
$5.561 USD en 60 días
0,0 (0 comentarios)
0,0
0,0

Sobre este cliente

Bandera de UNITED STATES
Tuxedo, United States
3,3
6
Forma de pago verificada
Miembro desde mar 25, 2014

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.